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DISEASE RELATED PROTEIN NETWORK 

The present invention relates to a method for generating a network of direct and 
indirect interaction partners of a disease-related (poly)peptide comprising the steps of 
(a) contacting a selection of (poly)peptides suspected to contain one or several of 
said direct or indirect interaction partners with said disease-related (poly)peptides 
and optionally with known direct or indirect interaction partners of said disease- 
related (poly)peptide under conditions that allow the interaction between interaction , 
partners to occur; (b) detecting (poly)peptides that interact with said disease-related 
(poly)peptide or with said known direct or indirect interaction partners of said 
disease-related (poly)peptide; (c) contacting (polypeptides detected in step (b) with a 
selection of (poly)peptides suspected to contain one or several (polypeptides 
interacting with said (polypeptides detected in step (b) under conditions that allow 
the interaction between interaction partners to occur; (d) detecting proteins that 
interact with said (poly)peptides detected in step (b); (e) contacting said disease- 
related (poly)peptide and optionally said known direct or indirect interaction partners 
of said disease-related (poly)peptide, said (poly)peptides detected in steps (b) and 
(d) and a selection of proteins suspected to contain one or several (poly)peptides 
interacting with any of the. afore mentioned (polypeptides under conditions that allow 
the interaction between interaction partners to occur; (f) detecting (poly)peptides that 
interact with said disease-related (polypeptide and optionally said known direct or 
indirect interaction partners of said disease-related (poly)peptide or with said 
(poly)peptides identified in step (b) or (d); and (g) generating a (polypeptide - 
(poly)peptide interaction network of said disease-related (poly)peptide and optionally 
said known direct or indirect interaction partners of said disease-related (poly)peptide 
and said (poly)peptides identified in steps (b), (d) and (f). Moreover, the present 
invention relates to a protein complex comprising at least two proteins and to 
methods for identifying compounds interfering with an interaction of said proteins. 
Finally, the present invention relates to a pharmaceutical composition and to the use 
of compounds identified by the present invention for the preparation of a 
pharmaceutical composition for the treatment of Huntington's disease. 
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Several documents are cited throughout the text of this specification. The disclosure 
content of the documents cited herein (including any manufacture's specifications, 
instructions, etc.) is herewith incorporated by reference. The present invention is 
based on scientific experiments which have been performed on biological specimen 
derived from diseased patients. Patients have given their consent to use the 
specimen for the study which is disclosed in the present invention. In case of 
deceased patients, the consent has been given by a relative. 

With the identification of >35.000 genes in the human genome the challenge arises to 
assign biological function to all proteins and to link these proteins to physiological 
pathways and disease processes. Since protein-protein interactions play a role in 
most events in a cell, clues to the function of an unknown protein can be obtained by 
investigating its interaction with other proteins whose function are already known. 
Thus, if the function of one protein is known, the function of the binding parners can 
be infered (deduced). This allows the researcher to assign a biological function to 
uncharacterized proteins by identifying protein-protein interactions. For example, 
several so far uncharacterized proteins in Caenorhabditis elegans were identified in a 
yeast two-hybrid screen for eukaryotic 26S proteasome interacting proteins and 
thereby could be linked to the ubiquitin-proteasome proteolytic pathway (Vidal et al., 
2001). Elucidation of protein-protein interactions is particularly desired when it comes 
to the generation of new drugs. For many diseases, the available drug portfolio is 
insufficient or inappropriate to provide a cure or to prevent onset of the disease. One 
such disease is Huntington's disease. 

Huntington's disease (HD) is a neurodegenerative disorder caused by an expanded 
polyglutamine (polyQ) tract in the multidomain protein huntingtin (htt). The elongated 
polyQ sequence is believed to confer a toxic gain of function to htt. It leads to htt 
aggregation primarily in neurons of the striatum and cortex and subsequently to the 
appearance of the disease phenotype. However, there is experimental evidence that 
loss of htt function may also contribute to HD pathogenesis. Since huntingtin 
aggregation correlates with disease progression, it is crucial to develop methods for 
identifying factors that promote or inhibit aggregation of huntingtin. 
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Previously, a number of single interaction partners of huntingtin had been reported. 
In light of these reports, it is tempting to speculate that huntingtin is bound into a 
larger network of interacting partners, many of which might be capable of modulating 
huntingtin's activity and function by direct or indirect interaction. It is likely that an 
aberrant interaction of huntingtin with some of the members of said network will 
impair huntingtin's normal function. Moreover, this interaction might also be relevant 
for the conformation of huntingtin or for its solubility or state of aggregation. 
Interfering with the direct or indirect interactions of the protein-protein interaction 
network will provide an excellent basis for therapeutic intervention as it will allow to 
modulate huntingtin's activity or state of aggregation or both. The state of the art so 
far did not provide compounds capable of reducing or suppressing huntingtin 
aggregation since the factors promoting or suppressing huntingtin aggregation were 
not known. 

Thus, the technical problem underlying the present invention was to provide novel 
approaches for identifying direct or indirect interaction partners of disease-related 
proteins, which must be seen as new targets for drug development. The solution to 
this technical problem is achieved by providing the embodiments characterized in the 
claims. 

Accordingly, the present invention relates to a method for generating a network of 
direct and indirect interaction partners of a disease-related (polypeptide comprising 
the steps of (a) contacting a selection of (poly)peptides suspected to contain one or 
several of said direct or indirect interaction partners with said disease-related 
(poly)peptides and optionally with known direct or indirect interaction partners of said 
disease-related (polypeptide under conditions that allow the interaction between 
interaction partners to occur; (b) detecting (poly)peptides that interact with said 
disease-related (polypeptide or with said known direct or indirect interaction partners 
of said disease-related (poly)peptide;(c) contacting (poly)peptides detected in step 
(b) with a selection of (polypeptides suspected to contain one or several 
(poly)peptides interacting with said (polypeptides detected in step (b) under 
conditions that allow the interaction between interaction partners to occur; (d) 
detecting proteins that interact with said (polypeptides detected in step (b); (e) 
contacting said disease-related (polypeptide and optionally said known direct or 
indirect interaction partners of said disease-related (polypeptide, said (polypeptides 
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detected in steps (b) and (d) and a selection of proteins suspected to contain one or 
several (poly)peptides interacting with any of the afore mentioned (poly) peptides 
under conditions that allow the interaction between interaction partners to occur; (f) 
detecting (poly)peptides that interact with said disease-related (polypeptide and 
optionally said known direct or indirect interaction partners, of said disease-related 
(polypeptide or with said (polypeptides identified in step (b) or (d); and (g) 
generating a (poly)peptide-(poly)peptide interaction network of said disease-related 
(poly)peptide and optionally said known direct or indirect interaction partners of said 
disease-related (poly)peptide and said (polypeptides identified in steps (b), (d) and 

(f). 

In accordance with the present invention, the term "direct and indirect interaction 
partners" relates to (poly) peptides that either directly interact with the disease-related 
(polyjpeptide (direct interaction) or that interact via a protein binding to/interacting 
with said disease-related (polypeptide. In the letter case, there is no direct contact 
between the direct interaction partner and the disease-related protein. Rather, a 
further protein forms a "bridge" between these two proteins. 

The term "known direct or indirect interaction partners" refers to the fact that for 
certain disease-related (polypeptides, such interaction partners are known in the art. 
If such interaction partners are known in the art, it is advantageous to include them 
into the method of the invention. If no such interactions partners are known in the art, 
then the network may be generated starting solely from the known disease-related 
(polypeptide. 

The term "conditions that allow the interaction between interaction partners to occur" 
relates to conditions that would, as a rule, resemble physiological conditions. 
Conditions that allow protein actions are well known in the art and, can be taken, for 
example from Golemis, E.A. Ed., Protein-Protein Interactions, Cold Spring Harbor 
Laboratory Press,. 2002. 

The term "suspected to contain one or more of said direct or indirect interaction 
partners" relates to the fact that normally, a selection of (polypeptides would be 
employed where the person skilled in the art would expect that interaction partners 
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are present. Examples of such selections of (poly)peptides are libraries of human 
origin such as cDNA libraries or genomic libraries. 

The term "detecting proteins" refers to the fact that the (polypeptides interacting with 
the "bait" (poly)peptides are identified within the selection of (polypeptides. A further 
characterization or isolation of the "prey" (polypeptides at this stage may be 
advantageous but is not necessary. The term "detecting (poly)peptides" preferably 
also comprises characterizing said (polypeptides or the nucleic acid molecules 
encoding said (poly)peptides. The skilled person knows that this can be done by a 
number of techniques, some of which are described for example in Sambrook et al., 
"Molecular Cloning, A Laboratory Manual"; CSH Press, Cold Spring Harbor, 1989 or 
Higgins and Hames (eds.). For example, the nucleotide sequence may be 
determined by DNA Sequencing, including PCR-Sequencing (see for example Mullis 
K, Faloona F, Scharf S, Saiki R, Horn G, Erlich H., Cold Spring Harb Symp Quant 
Biol. 1986;51 Pt 1:263-73). Alternatively, the amino acid sequence of said 
(polypeptide may be determined. The. skilled artesian knows various methods for 
sequencing proteins which include the method of Edman degradation, which is a 
preferred method of the present invention of determining the amino acid sequence of 
a protein. However, the amino acid sequence of a protein or (poly)peptide can also 
be reliably determined by methods such as for example Maldi-Tof, optionally in 
combination with the method of Edman degradation. The interaction partner may be 
identified either as fusion with a DNA binding domain or as fusion with an activation 
domain. Preferably, if an interaction partner has been identified as a fusion molecule 
comprising a DNA binding domain, the interaction partner is cloned into a vector 
allowing the expression of the interaction partner as a fusion with an activation 
domain. Consequently, protein interaction can be tested in the context the DNA 
activation or the DNA binding domain. 

In accordance with the present invention, the first round of detecting (poly)peptides 
that interact with the "bait" (poly)peptides recited in step (a) wherein the detected 
(polypeptides be considered as "prey" (polypeptides is followed by the second 
round of detecting further interacting (poly)peptides wherein the former "prey" 
(polypeptides are now used as "bait" (polypeptides. In certain preferred 
embodiments of the present invention such as in a two-hybrid detection system, a re- 
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cloning of the former "prey" (polypeptides into vectors that are suitable for 
expressing "bait" (polypeptides may be desired. 

Accordingly, the invention describes a novel strategy to identify protein-protein 
interaction networks for human disease proteins. This strategy was applied to detect 
pair-wise protein-protein interactions for Huntington's disease and is useful for other 
hereditary diseases as well. Several human hereditary diseases are summarized in 
table 5. 

A crucial step of the method of the invention is step (e). Here, the disease-related 
(polypeptide and optionally said known direct or indirect interaction partners of said 
disease-related (polypeptide are contacted under appropriate conditions, preferably 
at the same time, with both the (poly)peptides identified in steps (b) and (d) and 
further with a selection of (poly)peptides suspected to contain further interaction 
partners. Alternatively, the various baits, preys and further selection partners are 
added one after another, so that the final pool contains all baits and preys so far 
identified, in addition to the further selection partners. In other terms, in this step of 
the method of the invention, all "baits" and all "preys" are pooled and, additionally, 
further potential interaction partners are added. In this way, surprisingly the number 
of directed or indirect interactions partners of the previously identified "baits" and 
"preys" could significantly be enhanced. It is to be understood that various preys 
identified in one detection step may interact with each other and not only with, the 
baits that were employed for the identification. For example, if a collection of baits 
detects prays "a" and "b", the invention does not exclude that "a" also interacts with 
"b". The same holds true mutatis mutandis for the baits used in accordance with the 
present invention. Wherever possible, baits and preys are exchangeable in the sense 
that bait (poly)peptides may be used as preys and vice versa. In a given case, 
however, the skilled person has to determine whether or not this exchange is 
possible on the basis of unfavourable site effects and limitations of the applied 
scientific approach. This can be done by the skilled person without undue burden by 
applying standard techniques known in the art. 

It is further preferred in accordance with the present invention that the interaction of 
proteins is a specific interaction, such as a specific binding. This means that the 
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(poly)peptide being an interaction partner with a further (poly)peptide only or 
essentially only interacts with the interaction site(s) involved with this interaction 
partner. This does not exclude, of course, that further interaction sites of said 
(poly)peptide interact with further interaction partners, wherein in the corresponding 
interaction is preferably also specific. The concept also embraces that, if a 
(poly)peptide has several identical interaction sites, which in nature bind to different 
interaction partners, these different interaction partners are also bound by the 
(poly)peptide in the method of the present invention. 

In other terms, at least in the case of huntingtin, the number of interaction partners 
found in step (e) was enhanced in an exponential rather than in a linear fashion. 

The term "(poly)peptide" refers alternatively to peptide or to (polypeptides. Peptides 
conventionally are covalently linked amino acids of up to 30 residues, whereas 
polypeptides (also referred to as "proteins") comprise 31 and more amino acid 
residues. 

The term "huntingtin" refers to a protein with the data bank accession number 
P42858 which is referenced for the purpose of the present invention as "wild-type 
huntingtin. protein". However, the term "huntingtin" also comprises proteins encoded 
by the nucleic acid sequence deposited under accession number L12392 or to 
proteins encoded by nucleic acid molecules which hybridize to the nucleic acid 
molecule of L12392 under stringent conditions of hybridization. The present invention 
relates to all variants of the huntingtin protein. In particular, relevant for the present 
invention are those variants of huntingtin which comprise a polyglutamine tract 
(polyQ tnct) or an elongated polyQ tract. A polyQ tract consists of two or more 
glutamines within the huntingtin protein. The insertion of additional glutamine eodons 
will result in huntingtin proteins with, for example 2, 51, 75 or 100 added glutamines 
in comparison to the sequence deposited under accession number P42858. In fact, 
the person skilled in the art knows that the huntingtin protein may have a glutamine 
tract with any random number of glutamines in the range of 1 to 200 added 
glutamines. All these proteins are comprised by the present invention. 

The term "hybridizes under stringent conditions", as used in the description of the 
present invention, is well known to the skilled artisian and corresponds to conditions 
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of high stringency. Appropriate stringent hybridization conditions for each sequence 
may be established by a person skilled in the art on well-known parameters such as 
temperature, composition of the nucleic acid molecules, salt conditions etc.; see, for 
example, Sambrook et al., "Molecular Cloning, A Laboratory Manual"; CSH Press, 
Cold Spring Harbor, 1989 or Higgins and Hames (eds.), "Nucleic acid hybridization, a 
practical approach", IRL Press, Oxford 1985, see in particular the chapter 
"Hybridization Strategy" by Britten & Davidson, 3 to 15. Stringent hybridization 
conditions are, for example, conditions comprising overnight incubation at 42° C in a 
solution comprising: 50% formamide, 5x SSC (750 mM NaCI, 75 mM trisodium 
citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardfs solution, 10% dextran 
sulfate, and 20 micrograms/ml denatured, sheared salmon sperm DNA, followed by 
washing the filters in 0.1 x SSC at about 65°. Other stringent hybridization conditions 
are for example 0.2 x SSC (0.03 M NaCI, 0.003M Natriumcitrat, pH 7) bei 65°C. In 
addition, to achieve even lower stringency, washes performed following stringent 
hybridization can be done at higher salt concentrations (e.g. 5X SSC). Note that 
variations in the above conditions may be accomplished through the inclusion and/or 
substitution of alternate blocking reagents used to suppress background in 
hybridization experiments. Typical blocking reagents include Denhardfs reagent, 
BLOTTO, heparin, denatured salmon sperm DNA, and commercially available 
proprietary formulations. The inclusion of specific blocking reagents may require 
modification of the hybridization conditions described above, due to problems with 
compatibility. 

The skilled person knows that the presence of additional codons in the nucleic acid 
sequence of huntingtin might significantly reduce the capability of this nucleic acid 
molecule to hybridize to the nucleic acid molecule deposited under L12392 and 
referenced as wild-type huntingtin protein. Nevertheless, such proteins shall still be 
comprised by. the present invention. In fact, computer programs such as the 
computer program Bestfit (Wisconsin Sequence Analysis Package, Version 8 for 
Unix, Genetics Computer Group, University Research Park, 575 Science Drive, 
Madison, Wl 53711) or blast, capable of calculating homologies between two nucleic 
acid sequences, efficiently recognize nucleotide insertions and allow for an 
adjustment of gaps created by these insertions. The term "huntingtin" as used in the 
present invention, also includes those molecules of huntingtin, which have a 
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homology of more than 95% to wild-type huntingtin when analyzed with a program 
like bestfit under conditions not weighing gaps created by polyQ tracts (gap 
penalty=0). 

The term "contacting" means bringing into contact so that two or more proteins or 
(polypeptides can interact with each other, preferably under physiological conditions. 
The terms "interacting" or "binding" refer to a transient or permanent contact between 
two proteins or (polypeptides. Preferably, the (polypeptide or protein is provided by 
expression from a nucleic acid molecule, more preferably from a cDNA molecule 
within a cDNA library. Alternatively, said nucleic acid molecule is a genomic nucleic 
acid molecule of a genomic DNA library, or a nucleic acid molecule from a synthetic 
DNA or RNA library. Preferably, the nucleic acid molecule encoding the disease- 
related protein or its interaction partner is obtainable from nerve cells, brain tissue 
human adrenal gland, human bladder, human bone, human brain, human colon, 
human dorsal root ganglion, human heart, human HeLa cells, human kidney, human 
liver, human lung, human mammary gland, human ovary, human pancreas, human 
placenta, human prostate, human retina, human salivary gland, human sceletal 
muscle, human smail intestine, human smooth muscle, human spinal cord, human 
spleen, human stomach, human testis, human thymus, human thyroid, human tonsil, 
human trachea, human uterus, human cell line HEP G2, human cell line MDA 435, 
human fetal brain, human fetal heart, human fetal kidney, human fetal liver, human 
fetal spleen, human fetal thymus, human breast tumor, human cervix tumor, human 
colon tumor, human kidney tumor, human lung tumor, human ovary tumor, human 
stomach tumor, human brain tumor and/or human uterus tumor. 

The term "disease-related protein" refers to a protein known to be the causative 
agent of a disease or known to be involved in onset or progression of a disease. 
Preferably, said disease is CHOREA HUNTINGTON or the disease-rejated protein is 
huntingtin. More preferably, the disease-related protein is selected from table 6 
and/or 7. The term "conditions that allow the interaction between interaction partners" 
means conditions that are similar to physiological conditions. Preferably, said 
conditions are physiological conditions. 

The term "selection of (polypeptides" refers to a library of (polypeptides which 
comprises the above-mentioned libraries, but also includes libraries such as phage 
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display libraries. Preferably, the (poly)peptide is provided by expression from a 
nucleic acid molecule. Preferably, the protein or (poly)peptide expressed by said 
nucleic acid molecule is a (poly)peptide comprising a DNA binding domain (DBD) (in 
this case the fusion protein is termed "bait") or (b) a (poly)peptide comprising an 
activation domain capable of interacting with a transcription factor or an RNA 
polymerase and capable of activating transcription of a reporter or indicator gene (in 
this case the fusion protein is called "prey"). As used here, the terms "reporter gene" 
and "indicator gene" are to be understood as synonyms. It is important to note that 
one of the interaction partners will always comprise the amino acid sequence of a 
protein or (poly)peptide translated from said nucleic acid molecule while the other 
interaction partner will comprise the amino acid sequence of a protein or protein 
fragment. Preferably, a bait used for a method of the present invention is selected 
from the proteins listed in table 6 and/or 7. If, for example, the proteins encoded by 
the nucleic acid molecules contain a DNA binding domain fused in frame, the fusion 
protein can bind to the DNA recognition sequence of the DNA binding domain. 
Interaction of said fusion protein with a second fusion protein containing an activation 
domain can induce transcription of a nearby indicator gene. The indicator gene may 
encode a selection marker such as a protein that confers resistance to an antibiotic 
including ampicillin, kanamycin, chloramphenicol, tetracyclic hygromycin, neomycin 
or methotrexate. Further examples of antibiotics are Penicillins: Ampicillin HCI, 
Ampicillin Na, Amoxycillin Na, Carbenicillin disodium, Penicillin G, Cephalosporins, 
Cefotaxim Na, Cefalexin HCI, Vancomycin, Cycloserine. Other examples include 
Bacteriostatic Inhibitors such as: Chloramphenicol, Erythromycin, Lincomycin, 
Tetracyclin, Spectinomycin sulfate, Clindamycin HCI, Chlortetracycline HCI. 
Additional examples are proteins that allow selection with Bacteriosidal inhibitors 
such as those affecting protein synthesis irreversibly causing cell death. 
Aminoglycosides can be inactivated by enzymes such as NPT II which 
phosphorylates 3'-OH present on kanamycin, thus inactivating this antibiotic. Some 
aminoglycoside modifying enzymes acetylate the compound and block their entry in 
to the cell. Gentamycin, Hygromycin B, Kanamycin, Neomycin, Streptomycin, G418, 
Tobramycin Nucleic Acid Metabolism Inhibitors, Rifampicin, Mitomycin C, Nalidixic 
acid, Doxorubicin HCI, 5-Flurouracil, 6-Mercaptopurine, Antimetabolites, Miconazole, 
Trimethoprim, Methotrexate, Metronidazole, Sulfametoxazole. Alternatively, said 
indicator gene may encode a protein such as lacZ, GFP or luciferase, the expression 
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of which ean be monitored by detection of a specific color. Other proteins commonly 
used as indicator proteins are beta-galactosidase, beta-glucuronidase, green 
fluorescent protein (GFP), autofluorescent proteins, including blue fluorescent protein 
(BFP), glutathione-S-transferase (GST), luciferase, horseradish peroxidase (HRP), 
and chloramphenicol acetyltransferase (CAT). In general, however, the selection in 
the yeast two hybrid-system is based on a deficiency of the yeast strain to produce 
specific amino acids. The skilled person knows that any amino acid deficiency can be 
used for this selection strategy. 

Preferably said preys and baits are expressed from two separate expression vectors 
contained in one host cell. The nucleic acid molecule encoding the preys and baits 
can be introduced into the host cell, for example, by transformation, transfection, 
transduction or microinjection which are common techniques known to the person 
skilled in the art and which require no additional explanation. In addition, the nucleic 
acid molecule contains a chromosomal or episomal nucleic acid sequence encoding 
the above-mentioned indicator protein. The expression of said indicator protein is 
under control of a recognition sequence which serves as a binding site for the bait 
protein. The nucleic acid molecule may be fused either to a DNA binding domain or 
to an activation domain. Co-expression of only those bait- and prey fusion proteins 
which are capable of interacting will induce the expression of one of the above- 
identified indicator proteins and thus allow the identification a nucleic acid molecule 
encoding a protein capable of interacting with huntingtin or an interaction or binding 
partner of huntingtin. The skilled person knows this system as the yeast two hybrid 
system. The yeast two hybrid system, which uses a bait protein-prey protein 
combination to induce transcription of the reporter gene, is a preferred method to 
identify proteins capable of interacting with huntingtin or with a direct or indirect 
interaction or binding partner of huntingtin. See for example Fields and Song, Nature 
340:245 (1989) or Uetz et al., 2000 Nature 403(6770): 623-7. This is a useful way of 
determining protein-protein interactions. Another preferred method uses the yeast 
three hybrid system, as described in U.S. Pat. No. 5,928,868. Preferably, steps (a) to 
(d) of the method for generating a network of direct and indirect interaction partners 
comprise the yeast two hybrid system. Preferably, steps (e) and (f) of the method for 
generating a network of direct and indirect interaction partners comprise yeast 
interaction mating. Preferably, said "interaction mating" comprises the interaction of 
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alT interaction partners identified in steps (a) to (d). Also preferred is that the 
interaction partners identified in steps (a) to (d) interact as prey and bait proteins, so 
that all prey proteins are contacted with all bait proteins. Using the array mating 
system, each bait is tested individually for interaction with every prey in the array. 
Alternatively, steps (e) and (f) of the method for generating a network of direct and 
indirect interaction partners comprise testing all interaction partners identified in steps 
(a) to (d) in interaction assays such as biacore or coimmunoprecipitation. When 
performing such an assay, it is preferred that the interaction partners are tested as 
prey and/or bait fusion proteins or contain no fused (poly)peptides. Preferably, all 
interaction partners are contacted in the biacore or coimmunoprecipitation assay by 
themselves and by all other remaining interaction partners identified in steps (a) to 
(d). 

The method for generating a network of direct and indirect interaction partners of a 
disease related protein or (poly)peptide has proven to be an effective tool for 
unveiling the protein-protein interactions (PPI) of preferably monogenic diseases. 
This is exemplified by the analysis of the disease related protein of Chorea 
Huntington, the analysis of which has demonstrated that the method of the present 
invention will be useful in an approach to identify potential drugs in the treatment of 
CHOREA HUNTINGTON. Moreover, this method will also be effective in unveiling 
the protein-protein interactions of other disease related proteins and in identifying 
novel targets for treatment of these diseases. Using a preferred combination of 
library and matrix yeast two-hybrid screens, based on the methods of the present 
invention, a highly connected network was generated among 70 proteins involved in 
117 proHn-protein interactions, 99 of which had not been described previously. As 
progression of Huntington's disease (HD) appears to be linked to huntingtin 
aggregation, a set of network proteins was tested for their potential to modulate this 
process. By using the methods of the present invention, it was discovered that the 
GTPase activating protein GIT1 strongly promotes huntingtin aggregation in vivo. 
GIT1 also localises to huntingtin aggregates in brains of transgenic mice and HD 
patients. Therefore, a combination of the methods of the present invention has 
proven to provide effective means for the identification of potential targets for 
therapeutic intervention. GIT1 is a selected example of a modulator interaction 
partner of huntingtin. The other proteins in the network of interaction partners 
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disclosed by the present invention are further modulator interaction partners of 
huntingtin. 

Preferably, the interaction mating comprises using an array maiting system. In 
general, for this screen, MATa yeast cultures are transformed with plasmids 
encoding prey proteins and arrayed on a microtiter plate for interaction mating with 
individual MATa strains expressing bait proteins. Using this test system, each bait 
can be tested individually for interaction with every prey in the array. Diploid yeast 
clones, formed by maiting on YPD plates and expressing both, bait and prey 
proteins, are selected on agar SDH plates, and further transferred for example by a 
spotting robot on SDIV plates to select for protein-protein interactions. In a more 
preferred embodiment of the method, plasmids encoding bait and prey proteins are 
transformed into strains L40ccua and L40cca, respectively. L40cca clones are 
arrayed on microtitre plates and mixed with a single L40ccua clone for interaction 
mating. These cells are transferred, preferably by a robot onto YPD medium plates 
and, after incubation for 20h to 28h at approximately 30°C, for selection of the cells, 
were transferred onto SDII medium plates, where mating takes place, for additional 
60h to 80h at approximately 30°C. For two-hybrid selection diploid cells are 
transferred onto SDIV medium plates with and without nylon or nitrocellulose 
membranes and incubated for approximately 5 days at about 30°C. The nylon or 
nitrocellulose membranes are subjected to the (3-GAL assay. Positive clones can be 
verified by cotransformation assays using plasmids encoding respective bait and 
prey proteins. Other preferred methods for studying protein-protein interactions 
according to the present invention are colocalization, coimmunoprecipitation, 
screening of protein or (poly)peptide arrays, library screens, in vivo and in vitro 
binding experiments using different tags such as HIS6, TAP or FLAG. 

In a preferred embodiment of the present invention's method for generating a 
network of direct and indirect interaction partners of a disease related protein or 
(poly)peptide, plasmids encoding bait proteins are transformed into a strain such as 
L40ccua, tested for the absence of reporter gene activity and co-transformed with a 
human fetal brain cDNA library. Independent transformants are plated onto minimal 
medium lacking tryptophan, leucine, histidine and uracil (SDIV medium) and 
incubated at about 30°C for 5 to 10 days. Clones are transferred into microtitre 
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plates, optionally using a picking robot, and grown over night in liquid minimal 
medium lacking tryptophan and leucine (SDII medium). Subsequently, the clones are 
spotted onto nylon or nitrocellulose membranes placed on SDIV medium plates. After 
incubation for about 4 days membranes are subjected to a p-galactosidase (p-GAL) 
assay. Plasmids are prepared from positive clones and characterised, for example by 
restriction analyses and sequencing. For retransformation assays plasmids encoding 
bait and prey proteins are cotransformed in the yeast strain L40ccua and plated onto 
SDIV medium. 

The term "generating a protein-protein interaction (PPI) network" means listing the 
interactions of all proteins interacting or binding directly or indirectly interacting the 
disease related (poly)peptide or protein. Preferably, this can be done by displaying 
the information in a matrix or a network representation. In a more preferred 
embodiment of the present invention's method, the protein-protein interaction 
network is generated by using Pivot 1.0 (Prof. Ron Shamir, Prof. Yossi Shilo, Nir 
Orlev; Tel Aviv University (TAU); Dep. of computer science; Ramat Aviv; Tel Aviv 
69978; Israel). 

In a preferred embodiment of the invention, interactions are detected by using the 
yeast two-hybrid system, MALDI-TOF MS or electro spray MS. Preferably, yeast 
strains such as strains L40ccua and L40cca, are transformed with an expression 
selected from the group consisting of pBTM116, pBTM117, pBTM117c, pACT2, 
pAS2-1, pGADIO, pGAD424, pGAD425, pGAD426, pGAD427, pGAD428. 

In another preferred embodiment of the present invention's method for generating a 
network of direct and indirect interaction partners of a disease-related polypeptide, 
the method contains after step (d) the additional steps of isolating a nucleic acid 
molecule with homology to said nucleic acid molecule expressing the encoded 
protein and testing it for its activity as a modulator of huntingtin, wherein said nucleic 
acid molecule is DNA, RNA, cDNA, or genomic DNA. Said testing can be done in 
several different assays. Preferably, the testing is performed in a co- 
immunoprecipitation assay or an affinity chromatography-based technique. 
Generally, co-immunoprecipitation is performed by purifying an interacting protein 
complex with a single antibody specific for one protein in the protein complex and by 
detecting the proteins in the protein complex. The step of detection can involve the 
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use of additional antibodies directed against proteins suspected of being trapped in 
the purified protein complex. Alternatively, at least one protein in the protein complex 
is fused to a tag sequence with affinity to a compound fixed to a solid matrix. By 
contacting the solid matrix with said tagged protein, further proteins binding to said 
protein can be purified and binding can be detected. GST or HA are preferred tags in 
accordance with the present invention. 

In a preferred embodiment of the present invention's method, said contacting step (e) 
is effected in an interaction mating two hybrid approach. 

In another preferred embodiment of the present invention's method, said method 
comprises after step (d) and before step (e) the steps of: (d') contacting 
(poly)peptides detected in step (d) with a selection of (poly)peptides suspected to 
contain one or several (polypeptides interacting with said (poly)peptides detected in 
step (d) under conditions that allow the interaction between interaction partners to 
occur; and (d") detecting proteins that interact with said (poly)peptides detected in 
step (d'). 

This preferred embodiment of the invention, an additional step of identifying further 
interaction partners is carried out prior to the contacting of all "baits" and "preys" in 
one pool (step (e)). Optionally, further steps of selecting interaction partners in 
analogy to steps (d') and (d") may be infected prior to the pooling/interaction step. 

Diseases of particular interest for which interrelationships of disease-related proteins 
may be analyzed in accordance with the invention are provided in Table 5. 

In yet another preferred embodiment of the present invention's method, said disease 
related protein is a protein suspected of being a causative agent of a hereditary (see 
Table 5), such as a monogenic disease. 

In another preferred embodiment of the present invention's method, said disease 
related protein is huntingtin and said interaction partners are the interaction partners 
as shown in table 6,7 and/or 9 
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In another preferred embodiment of the present invention's method, said method 
comprises the step of determining the nucleotide sequence of a nucleic acid 
molecule encoding a direct or indirect interaction partner of the disease related 
protein. 

In another preferred embodiment of the present invention's method, said selections 
of proteins are translated from a nucleic acid library. 

In' another preferred embodiment of the present invention's method, said selection of 
proteins in step (a) and/or (c) and/or (d') and/or (e) is the same selection or a 
selection from the same source. In another preferred embodiment of the present 
invention's method, said selection of proteins in step (a) and/or (c) and/or (d') and/or 
(e) is a different selection or a selection from a different source. 

Preferably, said source is selected from nerve cells, brain tissue, human adrenal 
gland, human bladder, human bone, human brain, human colon, human dorsal root 
ganglion, human heart, human HeLa cells, human kidney, human liver, human lung, 
human mammary gland, human ovary, human pancreas, human placenta, human 
prostate, human retina, human salivary gland, human sceletal muscle, human small 
.intestine, human smooth muscle, human spinal cord, human spleen, human 
stomach, human testis, human thymus, human thyroid, human tonsil, human trachea, 
human uterus, human cell line HEP G2, human cell line MDA 435, human fetal brain, 
human fetal heart, human fetal kidney, human fetal liver, human fetal spleen, human 
fetal thymus, human breast tumor, human cervix tumor, human colon tumor, human 
kidney tumor, human lung tumor, human ovary tumor, human stomach tumor, human 
brain tumor and/or human uterus tumor. 

In another preferred embodiment of the present invention's method, said method is 
performed by contacting the proteins on an array. Preferably, said array is an array 
allowing to detect protein-protein interaction by the principle of a biacore detector. 

In another preferred embodiment of the present invention's method, said interactions 
are detected by using the yeast two-hybrid system. Preferably, said inteactions 
detected by using MALDI-TOF, MS, electro spray MS or biacore. 
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Irfanother preferred embodiment of the present invention's method, said method 
contains after step of (b), (d), (d n ) or (f) the additional steps of isolating a nucleic acid 
molecule with homology to said cDNA expressing the encoded protein and testing it 
for its activity as a modulator of huntingtin, wherein said nucleic acid molecule is 
DNA, or RNA, and preferably cDNA, or genomic or synthetic DNA, or mRNA. 

By using the methods disclosed herein, a rate of success or fidelity of at least 70% 
validatable protein-protein interactions (PPI) (of proteins within the protein interaction 
network of huntingtin) can be achieved. This level of consistency is well above the 
level described in the art. In order to increase the rate of success or fidelity, the 
skilled person can, when carrying out the methods of the present invention, combine 
the methods of the present invention with additional steps of testing. For example, a 
step of co-immunoprecipitation and/or an in vitro binding assay may be carried out, in 
cases when initially the interaction was determined by using the yeast-two-hybrid 
system (or vice versa). Such additional steps may be carried out at any stage of the 
methods of the present invention. For example, after but also prior to step (f) of the 
method of the present invention, PPIs may be verified using in-vitro binding and/or 
immunoprecipatation assays in order to increase the stringency of the method. By 
performing these additional steps of testing, the skilled person can increase the rate 
of success or fidelity to at least 50%, more preferably to at least 60%. For the 
additional validation, any method may be employed that is available to the skilled 
artisan for testing the protein interaction. For example, the skilled artisan may simply 
repeat the step(s) initially carried out, optionally by (slightly) altering the reaction 
conditions, preferably to more stringent reaction conditions, i.e. conditions that could 
be expected to further reduce the number of false positive interactions. Alternatively, 
a different method may be carried out in the validation process. For example, if the 
method of the invention employed two hybrid systems, the validation might be carried 
out by precipitation steps as outlined elsewhere in the specification. Whereas the 
method of the invention provides valid results without the additional validation 
step(s), the inclusion of such additional validation steps may be advantageous for 
certain purposes, e.g. drug target identification. In the case that a first validation step 
does not confirm that the protein in question is a member of the interaction network, 
further steps in this regard should be carried out. For example, it should be excluded 
that the validation step(s) do/does not catch weak protein interactions that 
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nevertheless are part of the network. The present invention also relates to a nucleic 
acid molecule encoding a modulator of huntingtin, wherein said modulator is a 
protein selected from table 8. Figure 6 provides the amino acid sequences of the new 
proteins or (poly)peptides listed in table 8. The term "modulator protein of huntingtin" 
comprises two types of proteins within the network of proteins interacting with 
huntingtin. Direct interaction or binding partners of huntingtin are those proteins in the 
PPI network of huntingtin that directly interact with or bind to huntingtin (see figure 2). 
Examples of these proteins are IKAP, HYPA, CA150, HIP1, HIP11, HIP13, HIP15, 
CGI-125, PFN2, HP28, DRP-1, SH3GL3, HZFH, HIPS, PIASy, HIP16, GIT1, Ku70 
and FEZ1. Table 7 and figure 6 provides a reference allowing to identify these 
proteins. The second class of proteins are indirect interaction or binding partners of 
huntingtin, i.e. those proteins in the PPI network of huntingtin that do not directly 
interact with or bind to huntingtin. Such proteins require a mediator, i.e. a direct 
binding partner of huntingtin to exert their huntingtin modulating function. Examples 
of these proteins are BARD1 or VIM, which bind to direct interaction partners of 
huntingtin; However, complexes of huntingtin and a direct interaction or binding 
partner are likely to interact with additional indirect interaction or binding partners. To 
summarize the above, modulator proteins of huntingtin can exert their function by 
direct or indirect contact to huntingtin. 

The term "modulator protein", as used in the present invention, refers to a protein 
capable of modulating the function or physical state of a second protein and 
comprises proteins that enhance or reduce (inhibit) the function or activity of 
huntingtin. Preferably, the modulator protein is a protein having an activity selected 
from the group consisting of oxidoreductase activity (acting on the CH-OH group of 
donors, acting on the aldehyde or oxo group of donors, acting on the CH-CH group of 
donors, acting on the CH-NH(2) group of donors, acting on the CH-NH group of 
donors, acting on NADH or NADPH, acting on other nitrogenous compounds as 
donors, acting on a sulfur group of donors, acting on a heme group of donors, acting 
on diphenols and related substances as donors, acting on a peroxide as acceptor, 
acting on hydrogen as donor, acting on single donors with incorporation of molecular 
oxygen, acting on the CH-OH group of donors, acting on superoxide as acceptor, 
oxidizing metal ions, acting on -CH(2) groups, acting on iron-sulfur proteins as 
donors, acting on reduced flavodoxin as donor, acting on phosphorus or arsenic in 
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donors, acting on x-H and y-H to form an x-y bond, other oxidoreductases), 
transferase activity (transferring one-carbon groups, transferring aldehyde or ketone 
residues, acyitransferases, glycosyltransferases, transferring alky! or aryl groups, 
other than methyl groups, transferring nitrogenous groups, transferring phosphorous- 
containing groups, transferring sulfur-containing groups, transferring selenium- 
containing groups), hydrolase activity (glycosylase activity, acting on ether bonds, 
acting on peptide bonds, acting on carbon-nitrogen bonds (other than peptide 
bonds), acting on acid anhydrides, acting on carbon-carbon bonds, acting on halide 
bonds, acting on phosphorus-nitrogen bonds, acting on sulfur-nitrogen bonds, acting 
on carbon-phosphorus bonds, acting on sulfur-nitrogen bonds, acting on carbon- 
phosphorus bonds, acting on sulfur-sulfur bonds, acting on carbon-sulfur bonds, 
lyases (carbon-carbon lyases, carbon-oxygen lyases, carbon-nitrogen lyases, 
carbon-sulfur lyases, carbon-halide lyases, phosphorus-oxygen lyases, other lyases), 
isomerases (racemases and epimerases, cis-trans-isomerases, intramolecular 
oxidoreductases, intramolecular transferases, intramolecular lyases, other 
isomerases), ligases activity (forming carbon-oxygen bonds, forming carbon-sulfur 
bonds, forming carbon-nitrogen bonds, forming carbon-carbon bonds, forming 
phosphoric ester bonds), transcription factor activity, filament protein, membrane 
protein and structural protein. 

In a preferred embodiment, the present invention's nucleic acid molecule is DNA, or 
RNA, and preferably cDNA, or genomic DNA or synthetic DNA or mRNA 

In another preferred embodiment of the invention, the nucleic acid molecule is double 
stranded or single stranded. 

In another preferred embodiment of the invention, the nucleic acid molecule is of 
vertebrate, nematode, . insect, bakterium or yeast. Preferably, the nematode is 
Caenorhabditis elegans. In another more preferred embodiment of the present 
invention, the insect is drosophila, preferably drosiphila melanogaster. In another 
more preferred embodiment of the present invention, the vertebrate is human, mouse 
rat, Xenopus laevis, zebrafish. 

In yet another preferred embodiment of the present invention, the nucleic acid 
molecule is fused to a heterologous nucleic acid molecule. In a further preferred 
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embodiment of the present invention, the heterologous (polypeptide encoded by 
said heterlogous nucleic acid molecule is an immunoglobulin Fc domain. 

In another preferred embodiment of the present invention the nucleic acid molecule is 
labeled. Labeled nucleic acid molecules may be useful for purification or detection. 
Suitable labels include fluorochromes, e.g. fluorescein isothiocyanate (FITC), 
rhodamine, Texas Red, phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6- 
FAM), 2\7^Jimethoxy-4\5'-dichloro-6-caitoxyfluorescein (JOE), 6-carboxy-X- 
rhodamine(ROX), S-carboxy^'.^J'^J-hexachlorofluorescein (HEX), 5- 
carboxyfluorescein (5-FAM) or N,N,N\N^etramemyl-6-carboxyrhodamine (TAMRA), 
radioactive labels, e.g. 32 P, 35 S, 3 H; etc. The label may also be a two stage system, 
where the DNA is conjugated to biotin, haptens, etc. having a high affinity binding 
partner, e.g. avidin, specific antibodies, etc., where the binding partner is conjugated 
to a detectable label. In the case of amplification the label may be conjugated to one 
or both of the primers. The pool of nucleotides used in the amplification may also be 
labeled, so as to incorporate the label into the amplification product. Alternatively, the 
double strand formed after hybridization can be detected by anti-double strand DNA 
specific antibodies or aptamers etc. 

In a more preferred embodiment said heterologous nucleic acid molecule encodes a 
heterologous polypeptide. Preferably said heterologous (poly)peptide, fused to the 
(polypeptide encoded by the nucleic acid molecule of the present invention, is a 
DNA binding protein selected from the group consisting of GAL4 (DBP) and LexA 
(DBP). Also preferred in accordance with the present invention are activation 
domains selected from the group consisting of GAL4(AD) and VP16(AD). Also 
preferred are (polypeptides selected from the group consisting of GST, His Tag, 
Flag Tag, Tap Tag, HA Tag and Protein A Tag. 

Thus, the sequence encoding the (polypeptide may be fused to a marker sequence, 
such as a sequence encoding a peptide which facilitates purification of the fused 
(poly)peptide. In certain preferred embodiments of this aspect of the invention, the 
marker amino acid sequence is a hexa-histidine peptide, such as the tag provided in 
a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311), among 
others, many of which are commercially available. As described in Gentz et al, Proc. 
Natl. Acad. Sci. USA 86:821-824 (1989), for instance, hexa-histidine provides for 
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convenient purification of the fusion protein. The "HA" tag is another peptide useful 
for purification which corresponds to an epitope derived from the influenza 
hemagglutinin protein, which has been described by Wilson et a/., Cell 37: 767 
(1984). 

The (polypeptide may be expressed in a modified form, such as a fusion protein, 
and may include not only secretion signals, but also additional heterologous 
functional regions. For instance, a region of additional amino acids, particularly 
charged amino acids, may be added to the N-terminus of the (poly)peptide to 
improve stability and persistence in the host cell, during purification, or during 
subsequent handling and storage. Also, peptide moieties may be added to the 
(poly)peptide to facilitate purification. Such regions may be removed prior to final 
preparation of the (poly)peptide. The addition of peptide moieties to (poly)peptides to 
engender secretion or excretion, to improve stability and to facilitate purification, 
among others, are familiar and routine techniques in the art. A preferred fusion 
protein comprises a heterologous region from immunoglobulin that is useful to 
stabilize and purify proteins. 

The present invention also relates to a method of producing a vector comprising the 
nucleic acid molecule the present invention. Furthermore, the present invention 
relates to a vector produced said method. 

The present invention also relates to a vector comprising the nucleic acid molecule of 
the present invention. Preferably said vector is a transfer or expression vector 
selected from the group consisting of pACT2; pAS2-1; pBTM116; pBTM1!7; 
pcDNA3.1; pcDNAI; pECFP; pECFP-C1; pECFP-N1; pECFP-N2; pECFP-N3; 
pEYFP-C1; pFLAG-CMV-5 a, b, c; pGADIO; pGAD424; pGAD425; pGAD427; 
pGAD428; pGBT9; pGEX-3X1; pGEX-5X1; pGEX-6P1; pGFP; pQE30; pQE30N; 
PQE30-NST; pQE31; pQE31N; pQE32; pQE32N; pQE60; pSE111; pSG5; pTET- 
CMV-AS; pTET-CMV-F°-AS; pTET-CMV-F°-S; pTET-CMV-MCS; pTET-CMV-S; pTK- 
Hyg; pTL1; pTL10; pTL-HAO; pTL-HA1 ; pTL-HA2; pTL-HA3; pBTM118c; pGEX-6P3; 
pACGHLT-C; pACGHLT-A; pACGHLT-B; pUP; pcDNA3.1-V5His; pMalc2x. Said 
expression vectors may particularly be plasmids, cosmids, viruses or bacteriophages 
used conventionally in genetic engineering plasmids, cosmids, viruses and 
bacteriophages used conventionally in genetic engineering that comprise the 
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aforementioned nucleic acid. Preferably, said vector is a gene transfer or targeting 
vector. Expression vectors derived from viruses such as retroviruses, vaccinia virus, 
adeno-associated virus, herpes viruses, or bovine papilloma virus, may be used for 
delivery of the nucleic acid into targeted cell population. Methods which are well 
known to those skilled in the art can be used to construct recombinant viral vectors; 
see, for example, the techniques described in Sambrook et al., Molecular Cloning A 
Laboratory Manual, Cold Spring Harbor Laboratory (1989) N.Y. and Ausubel et al., 
Current Protocols in Molecular Biology, Green Publishing Associates and Wiley 
Interscience, N.Y. (1989). 

In yet a further preferred embodiment of the invention the vector contains an 
additional expression cassette for a reporter protein, selected from the group 
consisting of (J-galactosidase, luciferase, green fluorescent protein and variants 
thereof. 

Preferably, said vector comprises regulatory elements for expression of said nucleic 
acid molecule. Consequently, the nucleic acid of the invention may be operatively 
linked to expression control sequences allowing expression in eukaryotic cells. 
Expression of said nucleic acid molecule comprises transcription of the sequence 
nucleic acid molecule into a translatable mRNA. Regulatory elements ensuring 
expression in eukaryotic cells, preferably mammalian cells, are well known to those 
skilled in the art. They usuaily comprise regulatory sequences ensuring initiation of 
transcription and, optionally, a poly-A signal ensuring termination of transcription and 
stabilization of the transcript, and/or an intron further enhancing expression of said 
nucleic acid. Additional regulatory elements may include transcriptional as well as 
translatiunal enhancers, and/or naturally-associated or heterologous promoter 
regions. Possible regulatory elements permitting expression in eukaryotic host cells 
are the AOX1 or GAL1 promoter in yeast or the CMV-, SV40-, RSV-promoter (Rous 
sarcoma virus), CMV-enhancer, SV40-enhancer or a globin intron in mammalian and 
other animal cells. Beside elements which are responsible -for the initiation of 
transcription such regulatory elements may also comprise transcription termination 
signals, such as the SV40-poly-A site or the tk-poIy-A site, downstream or the nucleic 
acid molecule. Furthermore, depending on the expression system used leader 
sequences capable of directing the (poly)peptide to a cellular compartment or 
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secreting it into the medium may be added to the coding sequence of the 
aforementioned nucleic acid and are well known in the art. The leader sequence(s) is 
(are) assembled in appropriate phase with translation, initiation and termination 
sequences, and preferably, a leader sequence capable of directing secretion of 
translated protein, or a portion thereof, into the periplasmic space or extracellular 
medium. Optionally, the heterologous sequence can encode a fusion protein 
including an C- or N-terminal identification peptide imparting desired characteristics, 
e.g., stabilization or simplified purification of expressed recombinant product. In this 
context, suitable expression vectors are known in the art such as Okayama-Berg 
cDNA expression vector pcDVI (Pharmacia), pCDM8, pRc/CMV, pcDNM , pcDNA3, 
the EchoTM Cloning System (Invitrogen), pSPORTI (GIBCO BRL) or pRevTet- 
On/pRevTet-Off or pCI (Promega). 

The present invention also relates to a method of producing a host cell comprising 
genetically engineering cells with the nucleic acid molecule or the vector of the 
present invention. The present invention also relates to a host cell produced said 
method. Furthermore, the present invention relates to a host cell comprising the 
vector of the present invention. Preferably, said host cell contains an endogenous 
nucleic acid molecule which is operably associated with a heterologous regulatory 
control sequence, including the regulatory elements contained in the vector of the 
present invention. 

The present invention also relates to a method of producing a (poly)peptide, 
comprising culturing the host cell of the present invention under conditions such that 
the (poly)peptide encoded by said polynucleotide is expressed and recovering said 
(poly)peptide. 

The present invention also relates to a (poly)peptide comprising an amino acid 
sequence encoded by a nucleic acid molecule of the present invention, or which is 
chemically synthesized, or is obtainable from the host cell of the present invention, or 
which is obtainable by a method of the present invention or which is obtainable from 
an in vitro translation system by expressing the nucleic acid molecule of the present 
invention or the vector of the present invention. 
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In another preferred embodiment of the invention, the (poly)peptide or protein is of 
vertebrate, nematode, insect, bakterium or yeast. Preferably, the nematode is 
Caenorhabditis elegans. In another more preferred embodiment of the present 
invention, the insect is Drosophila, preferably Drosophila melanogaster. In another 
more preferred embodiment of the present invention, the vertebrate is human, mouse 
rat, Xenopus laevis, zebrafish. 

In another preferred embodiment, the (polypeptide of the present invention is fused 
to a heterologous (polypeptide. Such a fusion protein may include not only secretion 
signals, but also additional heterologous functional regions. For instance, a region of 
additional amino acids, particularly charged amino acids, may be added to the 
N-terminus of the (polypeptide to improve stability and persistence in the host cell, 
during purification, or during subsequent handling and storage. Also, peptide 
moieties may be added to the (polypeptide to facilitate purification. Such regions 
may be removed prior to final preparation of the (polypeptide. The addition of 
peptide moieties to (polypeptides to engender secretion or excretion, to improve 
stability and to facilitate purification, among others, are familiar and routine 
techniques in the art. A preferred fusion protein comprises a heterologous region 
from immunoglobulin that is useful to stabilize and purify proteins. 

In a preferred embodiment of the present invention, the (polypeptide of the present 
invention is fused to a heterologous (polypeptide which is an immunoglobulin Fc 
domain or Protein A domain. In another preferred embodiment of the present 
invention, the (polypeptide the (poly)peptide is labelled. Preferably, the label is 
selected from the group consisting of fluorochromes, e.g. fluorescein isothiocyanate 
(FITC), rhodamine, Texas Red, phycoerythrin, allophycocyanin, 6-carboxyfluorescein 
(6-FAM), Z'J'-dimethoxy^'.S'-dichloro-e-carboxyfluorescein (JOE), 6-carboxy-X- 
rhodamine(ROX), B-carboxy^'.^.T'^J-hexachlorofluofescein (HEX), 5- 
carboxyfluorescein (5-FAM) or N.N.N'.N'-tetramethyl-e-carboxyrhodamine (TAMRA), 
radioactive labels, e.g. 32 P, 35 S, 3 H; etc. The label may also be a two stage system, 
where the protein or (polypeptide is conjugated to biotin, haptens, etc. having a high 
affinity binding partner, e.g. avidin, specific antibodies, etc., where the binding partner 
is conjugated to a detectable label. In another preferred embodiment of the present 
invention the label is a toxin, radioisotope, or fluorescent label. 
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In another preferred embodiment of the present invention, the (poly)peptide contains 
or lacks an N-terminal methionine, it is well known in the art that the N-terminal 
methionine encoded by the translation initiation codon generally is removed with high 
efficiency from any protein after translation in all eukaryotic cells. While the 
N-terminal methionine on most proteins also is efficiently removed in most 
prokaryotes, for some proteins this prokaryotic removal process is inefficient, 
depending on the nature of the amino acid to which the N-terminal methionine is 
covalently linked. 

The present invention also relates to a protein complex comprising at least two 
proteins, wherein said at least two proteins are selected from the group of interaction 
partners listed in table 9. The term "protein complex" refers to a compound stably 
comprising at least two proteins. Preferably, said stability allows to purify said protein 
complex. In a preferred embodiment of the present invention, the protein complex 
comprises GIT1 and huntingtin. 

The present invention also relates to the protein network of huntingtin, preferably the 
physical protein entities forming this network, which is described herein. In one 
embodiment, said protein network is formed by the interaction partners shown in 
table 6. Preferable, the protein network of the present invention is a validated protein 
network as described herein. 

i The present invention also relates to an antibody specifically recognizing the 
(poly)peptide of the present invention or specifically reacting with the protein complex 
of the present invention. This antibody is characterized in not recognizing the 
individual components of the protein complex but rather the complex itself. As such, 
said antibody recognizes a combined epitope, composed of amino acids of two 

! different proteins within the protein complex. Dissociation of the complex will be 
detrimental to antibody recognition. Therefore, antibody binding depends on the 
integrity of the protein complex. In a preferred embodiment of the present invention, 
the antibody is specific for a protein complex comprising GIT1 and huntingtin. 

In a preferred embodiment, the antibody of the present invention is polyclonal, 
) monoclonal, chimeric, single chain, single chain Fv, human antibody, humanized 
antibody, or Fab fragment 
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In a more preferred embodiment of the present invention the antibody is labeled. 
Preferably, the label is selected from the group consisting of fluorochromes, e.g. 
fluorescein isothiocyanate (FITC), rhodamine, Texas Red, phycoerythrin, 
allophycocyanin, 6-carboxyfluorescein (6-FAM), 2',7 , -dimethoxy-4 , ,5 , -dichloro-6- 
carboxyfluorescein (JOE), 6-carboxy-X-rhodamine(ROX), 6-carboxy-2',4',TAJ- 
hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N\N'-tetramethyl- 
6-carboxyrhodamine (TAMRA), radioactive labels, e.g. 32 P, 35 S, 3 H; etc. The label 
may also be a two stage system, where the antibody is conjugated to biotin, haptens, 
etc. having a high affinity binding partner, e.g. avidin, specific antibodies, etc., where 
the binding partner is conjugated to a detectable label. In another preferred 
embodiment of the present invention the label is a toxin, radioisotope, or fluorescent 
label. 

In a preferred embodiment of the present invention, the antibody is immobilized to a 
solid support. Preferably, the solid support may be the surface of a cell, a microtiter 
plate, beads or the surface of a sensor capable of detecting binding of the antibody 
or to the antibody. 

The present invention also relates to a method of identifying whether a protein 
promotes huntingtin aggregation, comprising (a) transacting a first cell with a nucleic 
acid molecule encoding a variant of the huntingtin protein or a fragment thereof 
capable of forming huntingtin aggregates; (b) co-transfecting a second cell with (i) a 
nucleic acid molecule encoding a variant of the huntingtin protein or a fragment 
thereof capable of forming huntingtin aggregates; and (ii) a nucleic acid molecule 
encoding a candidate modulator protein identified by the methods of the present 
invention or a nucleic acid molecule encoding a modulator protein selected from table 
6 or table 7 (c) expressing the proteins encoded by the transfected nucleic acid 
molecule of (a) and (b); (d) isolating insoluble aggregates of huntingtin from the 
transfected cell of (a) and (b); and (e) determining the amount of insoluble huntingtin 
aggregates from the transfected cell of (a) and (b), wherein an increased amount of 
huntingtin aggregates isolated from the transfected cells of (b) in comparison with the 
amount of huntingtin aggregates isolated from the transfected cells of (a) is indicative 
of a protein's activity as an enhancer of huntingtin aggregation. Preferably, the 
huntingtin protein or protein fragment of step (a) is HD169Q68 or HD510Q68. 
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The present invention also relates to a method of identifying, whether a protein 
inhibits huntingtin aggregation, comprising (a) transfecting a first cell with a nucleic 
acid molecule encoding a variant of the huntingtin protein or a fragment thereof 
capable of forming huntingtin aggregates; (b) co-transfecting a second cell with (i) a 
nucleic acid molecule encoding a variant of the huntingtin protein or a fragment 
thereof capable of forming huntingtin aggregates; and (ii) a nucleic acid molecule 
encoding a candidate modulator protein identified by the methods of the present 
invention or a nucleic acid molecule encoding a modulator protein selected from table 
6 or table 7 (c) expressing the proteins encoded by the transfected nucleic acid 
molecule of (a) and (b); (d) isolating insoluble aggregates of huntingtin from the 
transfected cell of (a) and (b); and (e) determining the amount of insoluble huntingtin 
aggregates from the transfected cell of (a) and (b), wherein a reduced amount of 
huntingtin aggregates isolated from the transfected cells of (b) in comparison with the 
amount of huntingtin aggregates isolated from the transfected cells of (a) is indicative 
of a protein's activity as an inhibitor of huntingtin aggregation. Preferably, the 
huntingtin protein or protein fragment of step (a) is HD169Q68 or HD510Q68 or 
HdexQSl. 

The term "promotes" means increasing the amount of huntingtin aggregation. 

Preferably said huntingtin protein or the fragments thereof is selected from the 
proteins listed in table 6 and/or 7. Preferably said insoluble aggregates are isolated 
by using a filter retardation method comprising lysing cells and boiling in 2%SDS for 
5min in the presence of 100mM DDT followed by a filtration step. The presence of 
aggregates is detected by using specific antibodies. 

In a preferred embodiment of the present invention, determining the amount of 
insoluble huntingtin is performed . by using light scattering or size exclusion 
chromatography. In another preferred embodiment of the present invention prior to 
step (d) the cells are treated with an ionic detergent. In yet another preferred 
embodiment of the methods of the present invention, the huntingtin aggregates are 
filtered or transferred onto a membrane. 

The present invention also relates to a method for identifying compounds affecting, 
e.g. interfering or enhancing the interaction of huntingtin or of a direct or indirect 
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interaction partner of huntingtin comprising (a) contacting interacting proteins 
selected from the group of interacting proteins listed in table 6 in the presence or 
absence of a potential modulator of interaction; and (b) identifying compounds 
capable of modulating said interaction. The contacting is performed under conditions 
that permit the interaction of the two proteins. Sometimes more than two interacting 
proteins might be present in a single reaction as additional interaction partners of 
those listed under table 6, can be tested. However, the compound may also be a 
small molecule. Preferably said compounds are antibodies directed to huntingtin or to 
said interaction partner listed in table 6, wherein these antibodies are capable of 
interfering with the interaction with huntingtin. Alternatively, said compound is a 
peptide fragment of 10 to 25 amino acid residues of an interaction partner listed in 
table 7, wherein said peptide fragment is capable of interfering with the interaction 
with huntingtin. In a more preferred embodiment of the present invention, said 
antibody is an antibody directed to GIT1. In another more preferred embodiment of 
the invention, said peptide fragment is a peptide fragment of GIT1 of 10 to 25 
capable of interfering with the interaction of GIT1 with huntingtin. Said interfering 
peptide may contain additional modifications in order to increase cellular uptake, 
solubility or to increase stability. Such modifications are known to the person skilled 
in the art and need not be listed here in detail. In a preferred embodiment of the 
present invention, the methods for identifying a compound further comprise the steps 
of modeling said compound by peptidomentics and chemically synthesizing the 
modeled compound. 

In another preferred embodiment of the present invention, the methods for identifying 
a compound further comprise producing said compound. In yet another preferred 
embodiment of the present invention, the method for identifying said compound 
further comprise modifiying to achieve (i) modified site of action, spectrum of activity, 
organ specificity, and/or (ii) improved potency, and/or (iii) decreased toxicity 
(improved therapeutic index), and/or (iv) decreased side effects, and/or (v) 

modified onset of therapeutic action, duration of effect, and/or (vi) modified 
pharmakinetic parameters (resorption, distribution, metabolism and excretion), and/or 
(vii) modified physico-chemical parameters (solubility, hygroscopicity, color, taste, 
odor, stability, state), and/or (viii) improved general specificity, organ/tissue 
specificity, and/or (ix) optimized application form and route by (i) esterification of 
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carboxyl groups, or (ii) esterification of hydroxyl groups with carbon acids, or (iii) 
esterification of hydroxyl groups to, e.g. phosphates, pyrophosphates or sulfates or 
hemi succinates, or (iv) formation of pharmaceutically acceptable salts, or (v) 
formation of pharmaceutically acceptable complexes, or (vi) synthesis of 
pharmacologically active polymers, or (vii) introduction of hydrophilic moieties, or (viii) 
introduction/exchange of substituents on aromates or side chains, change of 
substituent pattern, or (ix) modification by introduction of isosteric or bioisosteric 
moieties, or (x) synthesis of homologous compounds, or (xi) introduction of branched 
side chains, or (xii) conversion of alkyl substituents to cyclic analogues, or (xiii) 
derivatisation of hydroxyl group to ketales, acetales, or (xiv) N-acetylation to amides, 
phenylcarbamates, or (xv) synthesis of Mannich bases, imines, or transformation of 
ketones or aldehydes to Schiffs bases, oximes, acetales, ketales, enolesters, 
oxazolidines, thiozolidines or combinations thereof. 

The present invention also relates to a method of diagnosing Huntington's disease in 
a biological sample comprising the steps of (a) contacting the sample with an 
antibody specific for a protein of table 6 or 7 or an antibody specific for the protein 
complex of the present invention; and (b) detecting binding of the antibody to a 
protein complex, wherein the detection of binding is indicative of Huntington's 
disease or of a predisposition to develop Huntington's disease. Preferably, binding is 
detected by measuring the presence of a fluorescent label bound to the protein 
complex. 

In a preferred embodiment of the present invention's method protein complex 
contains (a) GIT1 or (b) said antibody is specific for a protein complex containing 
GIT1. 

In a preferred embodiment of the present invention, said protein complex contains (a) 
at least one protein selected from htt, HIP15 or HP28 or (b) said antibody is specific 
for a protein complex containing at least one protein selected from htt, HIP15 or 
HP28. 

The present invention also relates to a diagnostic agent/composition comprising the 
nucleic acid molecule of the present invention, the (polypeptide of the present 
invention including/or the (polypeptide mentioned in table 6 or 7, the antibody of the 
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present invention, an antibody specifically reacting with a protein selected from table 
7and/or a protein selected from table 7. 

Moreover, the present invention also relates to a pharmaceutical composition 
comprising the nucleic acid molecule of the present invention, the (polypeptide of the 
present invention, the interfering compound identified with a method of the present 
invention, the antibody of the present invention, an antibody specifically reacting with 
a protein selected from table 7 and/or a protein selected from table 7. 

The pharmaceutical composition will be formulated and dosed in a fashion consistent 
with good medical practice, taking into account the clinical condition of the individual 
patient, the site of delivery of the pharmaceutical composition, the method of 
administration, the scheduling of administration, and other factors known to 
practitioners. The "effective amount" of the pharmaceutical composition for purposes 
herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutical^ effective amount of 
pharmaceutical composition administered parenterally per dose will be in the range 
of about 1 pg protein /kg/day to 10 mg protein /kg/day of patient body weight, 
although, as noted above, this will be subject to therapeutic discretion. More 
preferably, this dose is at least 0.01 mg protein /kg/day, and most preferably for 
humans between about 0.01 and 1 mg protein /kg/day for the peptide. If given 
continuously, the pharmaceutical composition is typically administered at a dose rate 
of about 1 pg/kg/hour to about 50 pg/kg/hour, either by 1-4 injections per day or by 
continuous subcutaneous infusions, for example, using a mini-pump. An intravenous 
bag solution may also be employed. The length of treatment needed to observe 
changes and the interval following treatment for responses to occur appears to vary 
depending on the desired effect. 

Pharmaceutical compositions of the invention may be administered orally, rectally, 
parenterally, intracistemally, intravaginally, intraperitoneal^, topically (as by powders, 
ointments, drops or transdermal patch), bucally, or as an oral or nasal spray. By 
"pharmaceutical^ acceptable carrier" is meant a non-toxic solid, semisolid or liquid 
filler, diluent, encapsulating material or formulation auxiliary of any type. The term 
"parenteral" as used herein refers to modes of administration which include 
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intravenous, intramuscular, intraperitoneal, intrasternal, subcutaneous and 
intraarticular injection and infusion. 

The pharmaceutical composition is also suitably administered by sustained-release 
systems. Suitable examples of , sustained-release compositions include semi- 
permeable polymer matrices in the form of shaped articles, e.g., films, or 
mirocapsules. Sustained-release matrices include polylactides (U.S. Pat. No. 
3,773,919, EP 58,481), copolymers of L-glutamic acid and gamma-ethyl-L-glutamate 
(Sidman, U. et aL, Biopolymers 22:547-556 (1983)), poly (2- hydroxyethyl 
methacrylate) (R. Langer et al., J. Biomed. Mater. Res. 75:167-277 (1981), and R. 
Langer, Chem. Tech. 12:98-105 (1982)), ethylene vinyl acetate (R. Langer et aL, Id.) 
or poly-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained-release pharmaceutical 
composition also include liposomally entrapped protein, antibody, (poly)peptide, 
peptide or nucleic acid. Liposomes containing the pharmaceutical composition are 
prepared by methods known per se: DE 3,218,121; Epstein et al., Proc. Natl. Acad. 
Sci. (USA) 82:3688-3692 (1985); Hwang et al., Proc. Natl. Acad. Sci. (USA) 77:4030- 
4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese 
Pat. Appl. 83-118008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324. 
Ordinarily, the liposomes are of the small (about 200-800 Angstroms) unilamellar 
type in which the lipid content is greater than about 30 mol. percent cholesterol, the 
selected proportion being adjusted for the optimal therapy. 

For parenteral administration, in one embodiment, the pharmaceutical composition is 
formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutical^ 
acceptable carrier, i.e., one that is non-toxic to recipients at the dosages and 
concentrations employed and is compatible with other ingredients of the formulation. 
For example, the formulation preferably does not include oxidizing agents and other 
compounds that are known to be deleterious to (poly)peptides. 

Generally, the formulations are prepared by contacting the components of the 
pharmaceutical composition uniformly and intimately with liquid carriers or finely 
divided solid carriers or both. Then, if necessary, the product is shaped into the 
desired formulation. Preferably the carrier is a parenteral carrier, more preferably a 
solution that is isotonic with the blood of the recipient. Examples of such carrier 
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vehicles include water, saline, Ringer's solution, and dextrose solution. Non-aqueous 
vehicles such as fixed oils and ethyl oleate are also useful herein, as well as 
liposomes. The carrier suitably contains minor amounts of additives such as 
substances that enhance isotonicity and chemical stability. Such materials are non- 
toxic to recipients at the dosages and concentrations employed, and include buffers 
such as phosphate, citrate, succinate, acetic acid, and other organic acids or their 
salts; antioxidants such as ascorbic acid; low molecular weight (less than about ten 
residues) (polypeptides, e.g., polyarginine or tripeptides; proteins, such as serum 
albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspartic acid, or 
arginine; monosaccharides, disaccharides, and other carbohydrates including 
cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such as 
EDTA; sugar alcohols such as mannitol or sorbitol; counterions such as sodium; 
and/or nonionic surfactants such as polysorbates, poloxamers, or PEG. The 
proteinacous components of the pharmaceutical composition are typically formulated 
in such vehicles at a concentration of about 0.1 mg/ml to 100 mg/ml, preferably 1-10 
mg/ml, at a pH of about 3 to 8, It will be understood that the use of certain of the 
foregoing excipients, carriers, or stabilizers will result in the formation protein or 
(polypeptide salts. 

The components of the pharmaceutical composition to be used for therapeutic 
administration must be sterile. Sterility is readily accomplished by filtration through 
sterile filtration membranes (e.g., 0.2 micron membranes). Therapeutic components 
of the pharmaceutical composition (poly)peptide compositions generally are placed 
into a container having a sterile access port, for example, an intravenous solution 
bag or vial having a stopper pierceable by a hypodermic injection needle. 

The components of the pharmaceutical composition ordinarily will be stored in unit or 
multi-dose containers, for example, sealed ampoules or vials, as an aqueous solution 
or as a lyophilized formulation for reconstitution. As an example of a lyophilized 
formulation, 10-ml vialo are filled with 5 ml of sterile-filtered 1% (w/v) aqueous protein 
solution, and the resulting mixture is lyophilized. The infusion solution is prepared by 
reconstituting the lyophilized protein using bacteriostatic Water-for-Injection. 
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The invention also provides a pharmaceutical/diagnostic pack or kit comprising one 
or more containers filled with one or more of the ingredients of the 
pharmaceutical/diagnostic compositions of the invention. Associated with such 
containers) can be a notice in the form prescribed by a governmental agency 
regulating the manufacture, use or sale of pharmaceuticals or biological products, 
which notice reflects approval by the agency of manufacture, use or sale for human 
administration. In addition, the (poly)peptides of the components of the 
pharmaceutical composition invention may be employed in conjunction with other 
therapeutic compounds. 

Finally, the present invention relates to the use of the nucleic acid molecule of the 
present invention, the interfering compound identified with a method of the present 
invention, the (poly)peptide of the present invention including/or the (poly)peptide 
mentioned in table 6 or 7, the antibody of the present invention, an antibody 
specifically reacting with a protein selected from table 7 and/or a protein selected 
from table 7 for the preparation of a pharmaceutical composition for the treatment of 
Huntington's disease. 
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Table 1: 



PROTEIN-PROTEIN INTERACTIONS IN 
THE PPI OF HUNTINGTIN j 


Baits (udu; 


rreys 


dAKUI 


Dl ID 

rLlr 




trio 


HD1.7 


OATOU 


HD1.7 


Lll Dd 


|_J H "7 

HD1.7 


UVD A 

HYrA 




on«5ijLo 


HDexQ20 


UA1 OU 


rluexu^u 


UVD A 

HYrA 


HDexQ20 


onooLo 


HDexQol 


AA-jcn 
OA10U 


HDexQol 


UVD A 

nirn 


HDexQol 


onobLo 


mp53 


poo 


mpoo 


Dl ACw 


PiASy 




PIAby 


CI IMO ^ 


\ /I IV /I 

VIM 


MCCI 

IMtrL 


VIM 


VIIVIC 


BAKU1 


DAIr 1 


dAKLJ \ 


R AIP9 


dAKUI 


RAID** 


RARD1 


FEZ1 


BARD1 


GIT1 


BARD1 


HB01 


BARD1 


HIPS 


BARD1 


HZFH 


BARD1 


IKAP 


BARD1 


mHAP1 


BARD1 


NAG4 


BARD1 


PIASy 


BARD1 


PTN 


BARD1 


SETBD1 



WO 2004/113566 



35 



PCT/EP2004/006617 



BARD1 


ZHX1 


CLH-17 


Ku70 


CLK1 


PIASy 


GADD45G 


BAIP3 


GADD45G , 


CGI-125 


GADD45G 


CGI-74 


GADD45G 


EF1A 


GADD45G 


EF1G 


GADD45G 


G45IP1 


GADD45G 


G45IP2 


GADD45G 


G45IP3 


GADD45G 


HIP16 


GADD45G 


HIP5 


GADD45G 


LUC7B1 


GADD45G 


PIASy 


GADD45G 


PLIP 


GADD45G 


PTN 


GADD45G 


PTPK 


hADA3 


BAIP1 


hADA3 


Ku70 


hADA3 


MAGEH1 


hADA3 


PIASy 


HD1.7 


CGI-125 


HD1.7 


DRP-1 


HD1.7 


FEZ1 


HD1.7 


GIT1 


HD1.7 


HIP11 


HD1.7 


HIP13 


HD1.7 


HIP15 


HD1.7 


HIP16 


HD1.7 


HIP5 


HD1.7 


HZFH 


HD1.7 


IKAP 


HD1.7 


Ku70 


HD1.7 


PIASy 


HDcM.O 


FEZ1 


HDdlO 


GIT1 


HDdlO 


IKAP 


HDd1.3 


HZFH 



WO 2004/113566 



36 



PCT/EP2004/006617 



HDd1.3 


IKAP 


HDd1.3 


Ku70 


HDd1.3 


PIASy 


HDexQ20 


CGI-125 


HDexQ20 


HIP13 


HDexQ20 


HP28 


HDexQ20 


PFN2 


HDexQ51 


CGI-125 


HDexQ51 


HIP13 


HDexQ51 


HIP15 


HDexQ51 


HP28 


HDexQ51 


PFN2 


HIP2 


PIASy 


HIP5 


APP1 


HIP5 


BAIP1 


HIP5 


BAIP2 


HIP5 


CGI-74 


HIP5 


FEZ1 


HIP5 


GIT1 


HIP5 


HB01 


HIP5 


HMP 


HIP5 


KPNA2 


HIP5 


mHAP1 


HIP5 


NAG4 


HIP5 


PLIP 


IMPD2 


PIASy 


KPNB1 


PIASy 


KPNB1 


PTN 


mp53 


HZFH 


mp53 


ZHX1 


PIASy 


MAPIc3 


TAL1 


ZHX1 


TCP1G 


Ku70 


VIM 


ALEX2 


VIM 


BAIP1 


VIM 


DRP-1 


VIM 


G45IP1 


VIM 


HB01 


VIM 


HSPC232 



WO 2004/113566 



PCT/EP2004/006617 



3 7 



VIM 


HZFH 


VIM 


PIASy 


VIM 


SETBD1 


VIM 


SH3GL3 


2NF33B 


mHAP1 


ZNF33B 


ZHX1 



Table 2 Classification of proteins in Huntington's disease interaction network 



ID 


NAME 


FUSION 


ACCESSION 


IDEN 


aa MATCH 


LOC 


.Huntingtin fragments 












"HD1.7 


huntingtin 


non 
UoU 




inn 




N C 


HDdl.O 


huntingtln 


UDLf 




•ton 


1-320 


N, C 


HDd1.3 


huntingtin 


DBD 


P42B58 


TOO 


4GCi CAR 


IN, L/ 


HdexQ20 


huntingtin 


DBD 


P42858 


96 


1-90 


N,C 


HdexQ51 


huntingtin 


DBD 


P42858 


75 


4 QO 
1-OjZ 


N,C 


Transcriptional control and DNA maintenance 












BARD1 . 


BRCA1 associated ring domain protein 1 


DBD 


Q99728 


99 


1-379 


M 

N 


CA150 


putative transcription factor CA1 50 


AD 


014776 


93 


ortQ con 

299-029 


M 

N 


GADD45G 


growth arrest and DNA damage inducible protein GADD45 gamma 


DBD 


095257 


100 


18-159 


M 


hADA3 


ADA3 like protein 


DBD 


075528 


100 


nor A DO 

235-432 


M 


HB01 


histone acetyltransferase binding to ORC 


AD 


095251 


100 


1-611 


N 


PIASy 


protein inhibitor of activated STAT protein gamma (PIASy) 


AD, DBD 


/~\0 KlOV A/n 

Q8N2W9 


100 


5-510 


IN, O 


HYPA 


huntingtin Interacting protein HYPA/FBP11 (fragment) 


AD 


075400 


•inn 
100 


a Ann 


O, IN 


HZFH 


zinc finger helicase HZFH 


AD, DBD 


Q9Y4I0 


100 


1830-2000 


N 


IKAP 


IKK complex associated protein 


AD 


095163 


100 


1207-1332 


N,C 


Ku70 


ATP dependent DNA helicase H, 70 kDa subunit 


AD 


P12956 


100 


298^608 


N 


NAG4 


bromodomain containing protein NAG4 


AD 


Q9NPI1 


100 


94-651 


N 


'p53 


cellular tumor antigen p53 


AD 


P04637 


100 


1-393 


N 


p53c 


cellular tumor antigen p53 (C-terminus) 


AD 


P04637 


100 


248-393 


N 


mp53 


cellular tumor antigen p53 (mouse) 


DBD 


P02340 


100 


73-390 


N 


PLIP 


c?LA°. interacting protein 


AD 


095624 


100 


5-461 


N, PN 


SETDB1 


histone-Jysine N-methyitransferase, H3 lysine-9 specific 4 


AD 


Q15047 


100 


1023-1291 


N 


SUMO-2 


ubiqultin like protein SMT3A (SUMO-2) 


AD 


P55854 


100 


1-103 


C,N 


SUMO-3 


ubiquitin like protein SMT3B (SUMO-3) 


AD 


P55855 


100 


1-95 


C,N 


ZHX1 


zinc finger homeobox protein ZHX1 


AD 


Q9UKY1 


100 


145-873 


N 


ZNF33B 


zinc finger protein 33b 


DBD 


Q8NDW3 


100 


527-778 


N 


Cellular organization and protein transport 












APP1 


amyloid like protein 1 precursor 


AD 


P51693 


100 


243-555 


PM, EC 


CLH-17 


clathrin heavy chain 1 


DBD 


Q00610 


100 


1-289 


PM.V 


HP28 


axonemal dynein light chain (hp28) 


AD 


Q9BQZ6 


100 


3-258 


CN 


mHAP1 


huntingtin associated protein 1 (mouse) 


AD 


035668 


100 


3-471 


C. EE 


HIP1 - 


huntingtin interacting protein 1 


AD 


000291 


100 


**5-631 


C, GN 


HMP 


mitofilln 


AD 


Q16891 


100 


212-758 


Mit 


MAP1lc3 


microtubule associated proteins 1A/1B light chain 3 


AD 


Q9H491 


100 


58-170 


CN, MT 


NEFL 


light molecular weight neurofilament protein 


AD 


Q8IU72 


100 


1-543 


CN, IF 



WO 2004/113566 



PCT/EP2004/006617 



38 



PFN2 


profilln II 


AD 


P35Q80 


100 


1-140 


CN 


PTN 


plelotrophln precursor (exon 1 1ncluded) 


AD 


P21246 


100 


1-168 


PM, EC 


SH3GL3 


SH3 containing GRB2 like protein 3 


AD 


Q99963 


100 


3-347 


V 


KPNA2 


karyopherin alpha-2 subunit 


AD 


P52292 


100 


141-529 


C, N 


KPNB1 


karyopherin beta-1 subunit' 


DBD 


Q14974 


100 


CCO 07C 


C, N 


VIM 


vimentin 


DBD 


P08670 


100 


1-466 


CN, IF 


VIMc 


vimentln (C-terminus) 


AD 


P08670 


100 


190-466 


CN, IF 


Cell signaling and fate 












ALEX2 


armadillo repeat protein ALEX2 


AD 


060267 


100 


127-632 


C, PM 


CLK1 


protein kinase CLK1 


DBD 


P49759 


100 


209-484 


M 

N 


FEZ1 


fasciculation and elongation protein zeta 1 


AD 


Qb9oo9 


100 


«fOi *aoo 




GIT1 


ARF GTPaso activating protein GIT1 


An 

AU 


Wi? TZA/ 


Qfi 


9AQ-7fi1 

C+a- 1 \J I 


PM, V 


PTPK 


protein-tvrosine phosphatase kappa precursor 


AD 


Q 15262 


100 


1227-1439 


PM, AJ 


Cellular metabolism 












DRP-1 


dihydropyrimidinase related protein 1 (C-terminus) 


AD 


Q14194 


100 


345-572 


C 


IMPD2 


inosine-S'-monophosphate dehydrogenase 2 


DBD 


P12268 


100 


34-514 


C 


kTAL1 


transaldolase 


DBD 


P37837 


100 


3-337 


C 


^Protein synthesis and turnover 












EF1A 


translation elongation factor 1 alpha 1 


An 




inn 


294-462 


C, MT 


EF1G 


elongation factor 1 gamma 


An nRn 
f\U, uou 




inn 

IUU 




C MT 


EF1Gc 


elongation factor 1 gamma (C-terminus) 


An 




•inn 

i uu 




C MT 


HIP2 


ubiqultin conjugating enzyme E2-25 kDa 


DBD 


P27924 


100 


1-200 


C t N 


TCPG 


T-complex protein 1 , gamma subunit 






100 


252-544 


c 


Uncharacterized proteins 












BAIP1 


BARD1 interacting protein 1 [similar to RIKEN cDNA 1810018M11] 


An 


UoDoOu 


inn 

1 uu 


1-226 


UN 


BAIP2 


. BARD1 interacting protein 2 [hypothetical protein] 


An 

r\U 




100 


107-684 


UN 


BAIP3 


BARD1 interacting protein 3 [hypothetical protein] 


An 

AU 




100 


152-436 


UN 


CGI-74 


CGI-74 protein 


An 


uy i ooo 


*inn 

IUU 


1 59-270 


• UN 


CGI-125 


CGI-125 protein 


An 

AU 


Wo T Ovs 1 


inn 


i- 1 0 \ 


UM 

WIN 


G45IP1 


GADD45G Interacting protein 1 [hypothetical protein] 


AD 


AflUn\/7 

Q9nOV7 


•inn 
Tuu 






G45IP2 


GADD45G interacting protein 2 [B2 gene partial cDNA, clone B2E] 


An 


nQfjYAn 

vJSiN t rVJ 


inn 

IUU 




UN 


G45IP3 


GADD45G interacting protein 3 [OK/SW-CL16] 


AD 


DM IT/1 


TUu 




1 IM 

UlN 


H1P5 


huntingtin interacting protein 5 [hypothetical protein KIAA1377] 


AD, DBD 


Q9P2H0 


100 


445-988 


N,C 


, HIP11 


huntingtin interacting protein 11 [hypothetical protein] 


AD 


Q96E29 


100 


176-328 


UN 


' HIP13 


huntingtin interacting protein 13 [metastasis suppressor protein] 


AD 


Q96RX2 


100 


512-755 


UN 


HIP15 


huntingtin interacting protein 15 [similar to KIAA0443 gene product] 


AD 


Q96D09 


100 


663-838 


UN 


HIP16 


huntingtin interacting protein 16 [similar to KIAA0266 gene product] 


AD 


Q9BVJ6 


100 


585-771 


UN 


HSPC232 


HSPC232 


AD 


Q9P0P6 


92 


1-319 


UN 


LUC7B1 


putative SR protein LUC7B1 (SR+89) 


AD 


Q9NQ29 


99 


116-371 


ER 


MAGEH1 


melanoma associated antigen H1 


AD 


Q9H213 


100 


1-219 


UN 



Abbreviations: aa, amino acids; IDEN, Identity; LOC, localisation; AD, activation domain; DBD, DNA binding domain; AJ, 
adherens junctions; C, cytosol; CN, cytoskeleton; EC, extracellular space; EE, early endosomes; ER, endoplasmic reticulum; IF, 
intermediate filaments; GN, Golgi network; Wilt, mitochondria; WIT, microtubules; N, nucleus; PM, plasma membrane; PN, 
perinuclear; UN, unknown; V, vesicles; [ ], database annotation 
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Table 3 New proteins in Huntington's disease interaction network 

jO NAME FUSION ACCESSION IDEM aa MATCH LOC 

Transcriptional control and DMA maintenance 

BARD1 BRCA1 associated ring domain protein 1 DBD Q99728 99 1-379 N 

CA150 putative transcription factor CA1 50 AD 014776 93 299-629 N 



Cell signaling and fate 

GIT1 ARF GTPase activating protein GIT1 
HSPC232 HSPC232 

LUC7B1 putative SR protein LUC7B1 (SR+89) 



AD Q9Y2X7 98 249-761 PM.V 
AD Q9P0P6 92 1-319 UN 

AD Q9NQ29 99 116-371 ER 



Abbreviations: aa, amino acids; IDEN, identity; LOC, localisation; AD f activation domain; DBD, DNA binding domain; AJ, 
aSharens junctions; C, cytosol; CN, cytoskeleton; EC, extracellular space; EE, early endosomes; ER, endoplasmic reticulum; IF, 
intermediate filaments; GN, Golgi network; Mit, mitochondria; MT, microtubules; N, nucleus; PM, plasma membrane; PN, 
perinuclear; UN, unknown; V, vesicles; [ ], database annotation 



Table 4: 



New protein-protein interactions, found 


Baits (DBD) 


Preys (AD) 


BARD1 


BA1P1 


BARD1 


BAIP2 


BARD1 


BAiP3 


BARD1 


FEZ1 


BARD1 


GIT1 


BARD1 


HB01 


BARD1 


HIPS 


BARD1 


HZFH 


BARD1 . 


IKAP 


BARD1 


mHAP1 


BARD1 


NAG4 


BARD1 


PiASy 


BARD1 


PTN 


BARD1 


SETBD1 


BARD1 


ZHX1 


CLH-17 


Ku70 


CLK1 


PIASy 


GADD45G . 


BAIP3 


GADD45G 


CG1-125 


GADD45G 


CGI-74 
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GADD45G 


EF1A 


GADD45G 


EF1G 


GADD45G 


G45IP1 


GADD45G 


G45IP2 


GADD45G 


G45IP3 


GADD45G 


HIP16 


GADD45G 


HIP5 


GADD45G 


LUC7B1 


GADD45G 


PIASy 


GADD45G 


PLIP 


GADD45G 


PTN 


GADD45G 


PTPK 


hADA3 


BAIP1 


hADA3 


Ku70 


hADA3 


MAGEH1 


hADA3 


PIASy 


HD1.7 


CGI-125 


HD1.7 


DRP-1 


HD1.7 


FEZ1 


HD1.7 


GIT1 


HD1.7 


HIP11 


HD1.7 


HIP13 


HD1.7 


HIP15 


HD1.7 


HIP16 


HD1.7 


HIP5 


HD1.7 


HZFH 


HD1.7 


IKAP 


HD1.7 


Ku70 


HD1.7 


PIASy 


HDcM.O 


FEZ1 


HDdl.O 


GIT1 


HDdlO 


IKAP 


HDd1.3 


HZFH 


HDd1.3 


IKAP 


HDd1.3 


Ku70 


HDd1.3 


PIASy 


HDexQ20 


CGI-125 


HDexQ20 


HIP13 


HDexQ20 


HP28 
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HDexQ20 


PFN2 


HDexQ51 


CGI-125 


HDexQ51 


HIP13 


HDexQ51 


HIP15 


HDexQ51 


HP28 


HDexQ51 


PFN2 


HIP2 


PIASy 


HIP5 


APP1 


HIP5 


BAIP1 


HIP5 


BAIP2 


HIP5 


CGI-74 


HIP5 


FEZ1 


HIP5 


GIT1 


HIP5 


HB01 


HIP5 


HMP 


HIP5 i 


KPNA2 


HIP5 


mHAP1 


HIP5 


NAG4 


HIP5 


PLIP 


IMPD2 


PIASy 


KPNB1 


PIASy 


KPNB1 


PTN 


mp53 


HZFH 


mp53 


ZHX1 


PIASy 


MAPIc3 


TAL1 


ZHX1 


TCP1G 


Ku70 


VIM 


ALEX2 


VIM 


BAIP1 


VIM 


DRP-1 


VIM 


G45IP1 


VIM 


HB01 


VIM 


HSPC232 


VIM 


HZFH 


VIM 


PIASy 


VIM 


SETBD1 


VIM 


SH3GL3 


ZNF33B 


mHAP1 


ZNF33B 


ZHX1 
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Table 5: 

* Aarskog syndrome 

* Achromatopsia 

* Acoustic neuroma 

* Adrenal hyperplasia 

* Adrenoleukodystrophy 

* Agenesis of corpus callosum 

* Aicardi syndrome 

* Alagille syndrome 

* Albinism 

* Alopecia areata 

* Alstrom syndrome 

* Alpha-1 -antitrypsin deficiency 

* Alzheimer 

* Ambiguous genitalia 

* Androgen insensitivity syndrome(s) 

* Anorchia 

* Angelman syndrome 

* Anopthalmia 

* Apert syndrome 

* Arthrogryposis 

* Ataxia 

* Autism 

* Bardet-Biedl syndrome 

* Basal cell carcinoma 

* Batten disease 

* Beckwith-Wiedemann syndrome 

* Blepharophimosis 

* Blind 

* Branchio-Oto-Renal (BOR) syndrome 

* Canavan 

* Cancer: (ataxia telangiectasia, basal cell nevus, brain /spine, breast, colon / bowel, 
leukemia / lymphoma, lung, melanoma / skin, multiple endocrine neoplasia, oral, 
ovarian, prostate, retinoblastoma, testicular, von Hippel-Lindau, xeroderma 
pigmentosa) 

* Cardiof-ciocutaneous syndrome 

* Celiac sprue 

* Charcot-Marie-Tooth 

* CHARGE association 

* Chromosome anomalies - trisomy, deletions, inversions, duplications, 
translocations, 4p- (Wolf-Hirshhorn), 5 (cri-du-chat, 5p-), 6, 8p, 9 (trisomy 9, 9p-), 
11 (11q, 11:22), 13 (trisomy 13, Patau), 15, 16 (mosaic), 18 (18q-, 18p-, ring 18, 
trisomy 18, tetrasomy 18p, Edwards), 21 (Down syndrome, trisomy 21), 22, X & Y 
[sex chromosome anomalies, Klinefelter (XXY, other), Turner (XO, other), fragile-X, 
other] 

* Cleft Hp and/or cleft palate 

* Cockayne syndrome 

* Coffin-Lowry syndrome 

* Coffin-Siris syndrome 

* Congenital heart defects 
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* Connective tissue conditions 

* Cooley anemia 

* Conjoined twins 

* Cornelia de Lange syndrome 

* Costello syndrome • 

* Craniofacial conditions 

* Cri-du-Chat (5p-) 

* Cystic fibrosis 

* Cystinosis 

* Cystinuria 

* Dandy-Walker syndrome 

* Deaf / hard of hearing 

* Dermatological (skin) conditions 

* Developmental delay / mental retardation 

* DiGeorge syndrome 

* Down syndrome 

* DRPLA 

* Dubowitz syndrome 

* Dwarfism/ short stature 

* Dysautonomia 

* Dystonia 

* Ectodermal dysplasia 

* Ehlers Danlos syndrome 

* Endocrine Conditions 

* Epidermolysis bullosa 

* Facial anomalies, disfigurement 

* Fanconi anemia 

* Fetal alcohol syndrome and effects 

* FG syndrome 

* Fragile-X syndrome 

* Friedreich ataxia 

* Freeman Sheldon syndrome 

* Galactosemia 

* Gardner syndrome 

* Gastroenterology conditions 

* Gaucher disease 

* Glycogen storage disease 

* Goldenhar syndrome 

* Gorlin syndrome 

* Hallerman Streiff syndrome 

* Hearing problems 

* Heart conditions 

* Hemochromatosis 

* Hemophilia 

* Hemoglobinopathies 

* Hereditary hemorrhagic telangiectasia 

* Hereditary spastic paraplegia 

* Hermansky-Pudlak syndrome 

* Hirschsprung anomaly 

* Holoprosencephaly 
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* Huntington disease 

* Hydrocephalus 

* Ichthyosis 

* Immune deficiencies 

* Incontinentia pigmenti 

* Infertility 

* Intestinal problems 

* Joseph disease 

* Joubert syndrome 

* Kabuki syndrome 

* Kidney conditions 

* Klinefelter syndrome 

* Klippel-Feil syndrome 

* Klippel-Trenaunay syndrome 

* Langer-Giedion syndrome 

* Laurence-Moon-Biedl syndrome 

* Leber Optic Atrophy 

* Leigh disease 

* Lesch-Nyhan syndrome 

* Leukodystrophy [Adrenoleukodystrophy (ALD), Alexanders Disease, CADASIL 
(Cerebral Autosomal Dominant Adenopathy with Subcortical Infarcts & 

. Leukoencephalopathy), Canavan Disease (Spongy Degeneration), 
Cerebrotendinous Xanthomatosis (CTX), Globoid Cell (Krabbes) Leukodystrophy, 
Metachromatic Leukodystrophy (MLD), Ovarioleukodystrophy , Pelizaeus- 
Merzbacher Disease, Refsum Disease, van der Knaap syndrome, Zellweger 
syndrome] 

* Limb anomalies [missing arm(s) or leg(s), Poland anomaly, other] 

* Lissencephaly [Isolated Sequence (ILS), X-Linked (XLIS), Subcortical Band 
Heterotopia (SBH), Miller-Dieker syndrome (MDS), Microcephaly, 
Microlissencephaly (MLIS), Norman-Roberts syndrome (NRS), With Cerebellar 
Hypoplasia (LCH), Polymicrogyria (PMG), Schizencephaly (SCH), Muscle-Eye- 
Brain (MEB) Disease, and Walker-Warburg syndrome (WWS), 17p13.3 deletion] 

* Liver conditions (biliary atresia, Alagille syndrome, alpha-1 antitrypsin, tyrosinemia, 
neonatal hepatitis, Wilson disease) 

* Lowe syndrome 

* Lung / pulmonary conditions 

* Lymphedema 

* Maffucci syndrome(Ollier, multiple cartilaginous enchondromatosis) 

* Malignant hyperthermia 

* Maple syrup urine disease 

* Marinesco-Sjogren Syndrome 

* Marfan syndrome 

* Menke syndrome 

* Mental retardation / developmental delay 

* Metabolic conditions (carbohydrate deficient glycoprotein syndrome (CDGS), 
diabetes insipidus, Fabry, galactosemia, glucose-6-phosphate dehydrogenase 
(G6PD), fatty acid oxidation disorders, glutaric aciduria, hypophosphatemia, 
Krabbe, lactic acidosis, lysosomal storage diseases, mannosidosis, maple syrup 
urine, mitochondrial, neuro-metabolic, organic acidemias, PKU, purine, pyruvate 
dehydrogenase deficiency, urea cycle conditions, vitamin D deficient rickets) 
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* Miscarriage, stillbirth, infant death 

* Mitochondrial conditions (Alpers, Barth, beta-oxidation defects, carnitine deficiency, 
CPEO, Keams-Sayre, lactic acidosis, Leber optic neuropathy, Leigh, LCAD, Luft, 
MCAD, MAD, glutaric aciduria, MERRF, MNGIE, NARP, Pearson, PHD, SCAD, 
NADH-CoQ reductase, succinate dehydrogenase, Complex III, Complex IV, COX, 
Complex V, other) 

* Moebius syndrome 

* Mucolipidosis, type IV (ML4) 

* Mucopolysaccharidosis (Hunter syndrome, Hurler syndrome, Maroteaux-Lamy 
syndrome, Sanfilippo syndrome, Scheie syndrome, Morquio syndrome, other) 

* Multiple hereditary exostoses 

* Muscular dystrophy /atrophy (neuromuscular conditions including: Duchenne, 
facioscapulohumeral, Charcot Marie Tooth, spinal muscular atrophy, other) 

* Myotonic dystrophy 

* Nager & Miller syndromes 

* Nail Patella syndrome 

* Narcolepsy 

* Neurologic conditions (neuro-metabolic, neurogenetics, neuromuscular, other) 

* Neurofibromatosis (von Recklinghausen) 

* Neuromuscular conditions 

* Niemann-Pick disease 

* Noonan syndrome 

* Opitz syndromes [Opitz-Frias, Opitz FG (Opitz-Kaveggia), Opitz-C 
(Trigonocephaly)] 

* Organic acidemias 

* Osler-Weber-Rendu syndrome 

* Osteogenesis imperfecta 

* Oxalosis & hyperoxaluria 

* Pallister-Hall syndrome 

* Pallister-Kiilian syndrome (tetrasomy 12p, Teschler-Nicola syndrome) 

* Parkinsons disease 

* Periodic paralysis 

* Phenylketonuria (PKU) 

* Polycystic kidney disease 

* Popliteal pterygium syndrome 

* Porphyria 

* Prader-Willi syndrome 

* Progeria (Werner, Hutchinson-Gilford, Cockayne, Rothmond-Thomson syndromes) 

* Proteus syndrome 

* Prune belly syndrome 

* Pseudoxanthoma elasticum (PXE) 

* Psychiatric conditions 

* Refsum disease 

* Retinal degeneration 

* Retinitis pigmentosa (retinal degenerative diseases, Usher syndrome) 

* Retinoblastoma 

* Rett syndrome 

* Robinow syndrome 

* Rubinstein-Taybi syndrome 

* Russell-Silver syndrome 
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* SBMA 
*SCA 

* Schizencephaly 

* Sex chromosome anomalies (47,XXY, 47,XXX, 45,X and variants, 47.XYY) 

* Shwachman syndrome 

* Sickle cell anemia 

* Skeletal dysplasia 

* Smith-Lemli-Opitz syndrome (RHS syndrome) 

* Smith-Magenis syndrome (17p-) 

* Sotos syndrome 

* Spina bifida (myelomeningocele, neural tube defects) 

* Spinal muscular atrophy (Werdnig-Hoffman, Kugelberg-Welander) 

* Stickler / Marshall syndrome 

* Sturge-Weber 

* Tay-Sachs disease / other (dysautonomia, dystonia, Gaucher, Niemann Pick, 
Canavan, Bloom) 

* Thalassemia (Cooley anemia) 

* Thrombocytopenia absent radius syndrome 

* Tourette syndrome 

* Treacher Collins syndrome (craniofacial) 

* Trisomy (21, 18, 13, 9, other, see chromosome syndromes) 

* Tuberous sclerosis 

* Turner syndrome 

* Twins / triplets / multiple births 

* Unknown disorders 

* Urea cycle conditions 

* Usher syndrome 

* VATER association 

* Velo-cardio-facial syndrome (Shprintzen, DiGeorge, 22q deletion) 

* Visual impairment / blind 

* Von Hippel-Lindau syndrome 

* Waardenburg syndrome 

* Weaver syndrome 

* Werner syndrome 

* Williams syndrome 

* Wilson disease (hepatolenticular degeneration) 

* Xeroderma pigmentosum 

* Zellweger syndrome 
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TABLE 6 

PROTEIN-PROTEIN INTERACTIONS IN THE PROTEIN NETWORK OF HUNTINGTIN 



BAIT 


PREY 


SETDB1 


SUMO-3 


PIASy 


SUMO-3 


HZFH 


SUMO-3 


PIASy 


HYPA 


HZFH 


HYPA 


MAPI Ic3 


HYPA 


ZHX1 


HYPA 


PIASy 


HZFH 


HZFH 


HZFH 


GIT1 


HZFH 


VIM 


HZFH 


PIASy 


ZHX1 


HZFH 


ZHX1 


VIM 


ZHX1 


FEZ1 


HMP 


HZFH 


HMP 


HMP 


HMP 


PIASy 


HMP 


HZFH 


PTN 


HIP15 


PTN 


PIASy 


PTN 


PTN 


PTN 


FEZ1 


PTN 


KPNA2 


G45IP3 


GIT1 


G45IP3 


BAI n 1 


G45IP3 


FEZ1 


G45IP3 


SH3GL3 


G45IP3 


EF1A 


APP1 


SETDB1 


APP1. 


HIP16 


APP1 


GDF9 


APP1 


G45IP1 


APP1 


BAIP1 


APP1 


HIP5 


BAIP3 


GIT1 


BAIP3 


BAIP2 


BAIP3 


APP1 


BAIP3 
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FEZ1 


BAIP3 


NAG4 


BAIP3 


SETDB1 


BAIP3 


HB01 


BAIP3 


HIP15 


BAIP3 


BAIP3 


BA1P3 


HZFH 


BAIP3 


PLIP 


BAIP3 


mHAP1 


BAIP3 


PIASy 


BAIP3 


HMP 


BAIP3 


NAG4 


NEFL 


HZFH 


NEFL 


VIM 


NEFL 


PIASy 


NEFL 


HMP 


HIP5 


PLIP 


HIP5 


mHAP1 


HIP5 


HB01 


HIP5 


KPNA2 


HIP5 


VIM 


HIP5 


APP1 


HIP5 


HIP15 


HIP5 


NAG4 


HIP5 


GIT1 


HIP5 


BAIP1 


. HIP5 


FEZ1 


HIP5 


CGI-74 


HIP5 


BAIP2 


HIP5 


ALEX2 


ALEX2 


PIASy ' 


MAG EH 1 


KPNA2 


MAGEH1 


SETDB1 


CA150 


LUC7B1 


CA150 


HZFH 


CA150 


PIASy 


CA150 


PIASy 


hADA3 


BAIP1 


hADA3 


MAGEH1 ■ 


hADA3 


Ku70 


hADA3 


GIT1 


BARD1 
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BAIP3 


BARD1 


SETDB1 


BARD1 


CA150 


BARD1 


NAG4 


BARD1 


HIP15 


BARD1 


HIP5 


BARD1 


PTN 


BARD1 


FEZ1 


BARD1 


IKAP 


BARD1 


BAIP1 


BARD1 


mHAP1 


BARD1 


HB01 


BARD1 


BAIP2 


BARD1 


PLIP 


BARD1 


PIASy 


BARD1 


HZFH 


BARD1 


ZHX1 


BARD1 


SH3GL3 


HDexQ20 


HIP13 


HDexQ20 


CGI-1 25 


. HDexQ20, 


PFN2 


HDexQ20 


CA150 


HDexQ20 


HYPA 


HDexQ20 


HP28 


HDexQ51 


HYPA 


HDexQ51 


CA150 


HDexQ51 


SH3GL3 


HDexQ51 


HIP13 


HDexQ51 


HIP15 


HDexQ51 


PFN2 


HDexQ51 


CG1-125 


HDexQ51 


LUC7B1 


GADD45G 


GDF9 


.. GADD45G 


PTN 


GADD45G 


BAIP3 


GADD45G 


G45IP2 


GADD45G 


HIP16 


GADD45G 


G45IP3 


GADD45G 


CGI-125 


GADD45G 


G45IP1 


GADD45G 


HIPS 


GADD45G 
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EF1G 

EF1A 

PLIP 

PIASy 

CGI-74 

PTPK 

MAPI Ic3 

SUMO-2 

SUMO-3 

HYPA 

HIP16 

DRP-1 

HZFH 

SH3GL3 

HIP13 

CGI-125 

CA150 

HIP11 

Ku70 

HIP1 



GADD45G 

GADD45G 

GADD45G 

GADD45G 

GADD45G 

GADD45G 

PIASy 

PIASy 

PIASy 

HD1.7 

HD1.7 

HD1.7 

HD1.7 

HD1.7 

HD1.7 

HD1.7 

HD1.7 

HD1.7 

HD1.7 

HD1.7 



IKAP 


HD1.7 


PFN2 


HD1.7 


FEZ1 


• HD1.7 


GIT1 


HD1.7 


HIP5 


HD1.7 


PIASy 


HD1.7 


GIT1 


HDdl.O 


IKAP 


HDdl.O 


FEZ1 


HDdl.O 


PIASy 


HDd1.3 


IKAP 


HDd1.3 


HZFH 


HDd1.3 


Ku70 


HDd1.3 


PIASy 


HIP2 


Ku70 


CLH-17 


HZFH 


mp53 


ZHX1. 


mp53 


p53 


mp53 


PIASy 


mp53 


PLIP 


GAPD 
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PIASy 


IMPD2 


EF1G 


EF1G 


HIP11 


EF1G 


HZFH 


TAL1 


ZHX1 


TAL1 


Ku70 


TCPG 


PIASy 


CLK1 


mHAP1 


ZNF33B 


ZHX1 


ZNF33B 


HZFH 


KPNB1 


PIASy 


KPNB1 


PTN 


KPNB1 


ALEX2 


VIM 


SH3GL3 


VIM 


PIASy 


VIM 


HIP16 


VIM 


HB01 


VIM 


BA1P1 


VIM . 


DRP-1 


VIM 


G45IP1 


VIM 


MOV34 


VIM 


\/IM 


VIM 

V llvl 


NEFL 


VIM 


HSPC232 


VIM 


SETDB1 


VIM 


HIP15 


HD1.7 


HP28 


HDexQ20 
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Table 8 New proteins in Huntington's disease interaction network 


ID NAME 


FUSION 


ACCESSION 


PEN 


aa MATCH 


LOG 


TMncnrinfinnal pnniml anH PIMA maintonanra 

transcriptional coninji ctiiu uiw iiiainisnancc 












BARD1 BRCA1 associated ring domain protein 1 


DBD 


Q99728 


99 


1-379 


N 


^*A"i^n j-si ifatn/Q franQprintinn factor f^A1 ^(\ 


AD 


014776 


93 


299-629 


N 


Protein synthesis and turnover 












MOV34 MOV34 isolog 


AD, DBD 


015387 


95 


1-297 


C,N 


Ceil signaling and fate 












GIT1 ARF GTPase activating protein GIT1 


AD 


Q9Y2X7 


98 


249-761 


PM.V 


HSPC232 HSPC232 


AD 


Q9P0P6 


92 


1-319 


UN 


LUC7B1 putative SR protein LUC7B1 (SR+89) 


AD 


Q9NQ29 


99 


116-371 


ER 



Abbreviations: aa, amino acids; IDEN, identity; LOC, localisation; AD, activation domain; DBD, DNA binding domain; AJ, 
.adherens junctions; C, cytosol; CN, cytoskeleton; EC, extracellular space; EE, early endosomes; ER, endoplasmic reticulum; IF, 
intermediate filaments; GN, Golgl network; Mit, mitochondria; WIT, microtubules; N, nucleus; PM, plasma membrane; PN, 
perinuclear; UN, unknown; V, vesicles; [ ], database annotation 
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Table 9: New protein-protein interactions found 



BAIT 


PREY 


SETDB1 


SUMO-3 


PIASy 


SUMO-3 


HZFH 


SUMO-3 


PIASy 


HYPA 


HZFH 


HYPA 


MAP1lc3 


HYPA 


ZHX1 


HYPA. 


PIASy 


HZFH 


HZFH 


HZFH 


GIT1 


HZFH 


VIM 


HZFH 


PIASy 


ZHX1 


HZFH 


ZHX1 


VIM 


ZHX1 


FEZ1 


HMP 


HZFH 


HMP 


HMP 


HMP 


PIASy 


HMP 


HZFH 


PTN 


HIP15 


PTN 


PIASy 


PTN 


PTN. 


PTN 


FEZ1 


PTN 


KPNA2 


G45IP3 


GIT1 


' G45IP3 


BAIP1 


G45IP3 


FEZ1 


G45IP3 


SH3GL3 


G45IP3 


EF1A 


APP1 


SETDB1 


APP1 


HIP16 


APP1 


GDF9 


APP1 


G45IP1 


APP1 


BAIP1 


APP1 


HIP5 


BAIP3 


GIT1 


BAIP3 


BAIP2 


BAIP3 


APP1 


BAIP3 


FEZ1 


BAIP3 



WO 2004/113566 



PCT/EP2004/006617 



NAG4 


BAIP3 


SETDB1 


BAIP3 


HB01 


BAIP3 


HIP15 


BAIP3 


BAIP3 


BAIP3 


HZFH 


BAIP3 


PUP 


BAIP3 


mHAP1 


BAIP3 


PIASy 


BA1P3 


HMP 


BAIP3 


NAG4 


NEFL 


HZFH 


NEFL 


VIM 


NEFL 


PIASy 


NEFL 


HMP 


HIP5 


PUP 


HIP5 


mHAP1 


HIP5 


HB01 


HIP5 


KPNA2 


HIP5 


VIM 


HIP5 


APP1 


HIP5 


HIP15 


HIP5 


NAG4 


HIP5 


GIT1 


HIP5 


BAIP1 


HIP5 


FEZ1 


HIP5 


CGI-74 


HIP5 


BAIP2" 


HIP5 


ALEX2 


ALEX2 


PIAsy 


MAGEH1 


KPNA2 


MAGEH1 


SETDB1 


CA150 


LUC7B1 


CA150 


HZFH 


CA150 


PIASy 


CA150 


PIASy 


hADA3 


BAIP1 


hADA3 


MAGEH1 


hADA3 


Ku70 


hADA3 


GIT1 


BARD1 


BAIP3 


BARD1 
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SETDB1 


BARD1 


CA150 


BARD1 


NAG4 


BARD1 


HIP15 


BARD1 


HIP5 


BARD1 


PTN 


BARD1 


FEZ1 


BARD1 


IKAP 


BARD1 


BAIP1 


BARD1 


mHAP1 


BARD1 


HB01 


BARD1 


BAIP2 


BARD1 


PLIP 


BARD1 


PIASy 


BARD1 


HZFH 


BARD1 


ZHX1 


BARD1 


HIP13 


HDexQ20 


CGI-125 


HDexQ20 


PFN2 


HDexQ20 


HP28 


HDexQ51 


HIP13 


HDexQ51 


HIP15 


HDexQ51 ' 


PFN2 


HDexQ51 


CGI-125 


HDexQ51 


LUC7B1 


GADD45G 


GDF9 


GADD45G 


PTN 


GADD45G 


BAIP3 


GADD45G 


G45IP2 


GADD45G 


HIP16 . 


GADD45G 


G45|P3 


GADD45G 


CGI-125 


GADD45G 


G451P1 


GADD45G 


HIP5 


GADD45G 


EF1G 


GADD45G 


EF1A 


GADD45G 


PLIP 


GADD45G 


PIASy 


GADD45G 


CGI-74 


GADD45G 


PTPK 


GADD45G 


MAPI Ic3 


PIASy 
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SUMO-2 


PIASy 


SUMO-3 


PIASy 


HIP16 


HD1.7 


DRP-1 


HD1.7 


HZFH 


HD1.7 


HIP13 


HD1.7 


CGI-125 


HD1.7 


HIP11 


HD1.7 


Ku70 


HD1.7 


IKAP 


HD1.7 


PFN2 


HD1.7 


FEZ1 


HD1.7 


GIT1 


HD1.7 


HIP5 


. HD1.7 


PIASy 


HD1.7 


GIT1 


HDdl.O 


IKAP 


HDdl.O 


FEZ1 


HDdl.O 


PIASy 


HDd1.3 


IKAP 


HDd1.3 


HZFH 


HDd1.3 


Ku70 


HDd1.3 


PIASy 


HIP2 


Ku70 


CLH-17 


HZFH 


mp53 


ZHX1 


mp53 


p53 


mp53 


PIASy 


mp53 


PLIP 


GAPD 


PIASy 


IMPD2 


EF1G 


EF1G 


HIP11 


EF1G 


HZFH 


TALI 


ZHX1 


. TAL1 


Ku70 


TCPG 


PIASy 


CLK1 



mHAP1 ZNF33B 
. ZHX1 ZNF33B 



. HZFH KPNB1 
PIASy KPNB1 
PTN KPNB1 
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ALEX2 


VIM 


SH3GL3 


VIM 


PIASy 


VIM 


HIP16 


VIM 


HZFH 


VIM 


HB01 


VIM 


BAIP1 


VIM 


DRP-1 


VIM 


G45IP1 


VIM 


MOV34 


VIM 


\f\KA 
VIM 


\/lM 
VIM 


NEFL 


VIM 


HSPC232 


VIM 


SETDB1 


VIM 


HIP15 


HD1.7 


HP28 


HDexQ20 
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Supplementary Table 1: List of DBD proteins for 1 d round of Y2H library screens 



.ID 


NAME 


ACCESSION 


aa MATCH 


PPIs 


BARD1 


BRCA1 associated ring domain protein 1 


Q99728 


1-379 


3 


CLH-17 


clathrin heavy chain 1 


Q00610 


1-289 


1 


CLK1 


protein kinase CLK1 


P49759 


209484 


1 


GADD45G 


growth arrest and DNA-damage-inducible protein 
GADD45 gamma 


095257 


18-159 


6 


hADA3 


ADA3 like protein 


075528 


235432 


1 . 


HD1.7 


huntingtin 


P42858 


1-606 


5 


HDdl.O 


huntingtin 


P42858 


1-320 


1 


HDd1.3 


huntingtin 


P42858 


166-506 


2 


HDexQ20 


huntingtin 


P42858 


1-90 


3 


HDexQ51 


huntingtin 


P42858 


1-82 


4 


HIP2 


ubiquitin conjugating enzyme E2-25 kDa 


P27924 


1-200 


1 


IMPD2 


inosine-6'-monophosphate dehydrogenase 2 


P 12268 


34-514 


1 


KPNB1 


karyopherin beta-1 subunit 


Q14974 


668-876 


1 


mp53 


cellular tumor antigen p53 (mouse) 


P02340 


73-390 


2 


TAL1 


transaldolase 


P37837 


3-337 


1 


TCPG 


T-rnmnlox nrntpin 1 nam ma suhunit 


P49368 


252-544 


1 


VIM 


vrimpnfin 
vji iici imi 


P08670 


1465 


6 


ZNF33B 


zinc finger protein 33b 


Q8NDW3 


527-778 


1 


14-3-3 


14-3-3 protein epsilon 


P42655 


no OCC 


AA 
AA 


DNAJ 


DnaJ homolog subfamily A member 1 


P31689 


1 13-379 


AA 


HD513Q68 


huntingtin 


P42858 




A A 

AA 


HIP1 


huntingtin interacting protein 1 


000291 


245-631 


AA 


mAP2A1 


ceadaptin A (mouse) 


P 17426 


697-971 


A A 

AA 


mAP2A2 


a-adaptin C (mouse) 


P 17427 


697-938 


AA 


mHAP 


huntingtin associated protein 1 (mouse) 


nieces 

035668 


<J A*74 

o-4/ 1 


AA 

AA 


RFA 


replication protein A 70 kua DNA-binding subunit 






AA 


onJoLo 


onj Cuniaining ur\D^ ukc piuifcjjn o 


Q99963 


3-347 


AA 




7MPOCQ 


075312 


29460 


AA 


ACTG1 


gamma-actin 


P02571 


182-375 




ALBU 


serum albimin precursor 


P02768 


1-249 




ALDA 


fructose-bisphosphate aidoiase A 


P04075 


1-363 


■ 


AM PL 


cytosol aminopepBdase 


P28838 


46487 




ARF4L 


ADP-ribosylation factor-like protein 4L 


P49703 


33-201 


- 


ASNS 


glutamine-dependent asparagine synthetase 


P08243 


318-560 




BCK 


creatine kinase, B chain 


P12277 


92-381 




CLH-17 ' 


clathrin heavy chain 1 


Q00610 


1165-1671 


- 


GAPDH 


glyceraldehyde 3-phosphate dehydrogenase 


P04406 


1-334 




HO-CT 


huntingtin 


P42858 


2721-3144 • 




LDHB 


L-lactate dehydrogenase b chain 


P07195 


96333 




MDHM 


malate dehydrogenase, mitochondrial precursor 


P40926 


1-338 




M0V34 


M0V34 isolog 


015387 


. 76-297 




NSFL1C 


p97 cofactorp47 


Q9UNZ2 


201-370 




PEBP 


phosphatidyiethanolamine-binding protein 


P30086 


1-186 




PHGDH 


D-3-phosphoglycerate dehydrogenase 


043175 


1-653 




PLD2 


phosphoiipase D2 


014939 


163336 




TIP49 


49 kDa TBF-interacting protein 


Q9Y265 


1456 




TRFE 


serotransferrin precursor 


P02787 


213-698 




TUBA1 


alpha-tubulin 1 


P05209 


1451 




TUBB4 


tubulin beta-4 chain 


Q13509 


113450 




UBC1 


polvubiquib'n C 


Q9UEF2 


1-685 





Abbreviations: aa, amino acids; DBD, DNA binding domain; PPIs, protein-protein interactions; AA, 
autoactivation of reporter gene. 
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Supplementary Table 2: Subcloned DBD proteins for 2 nd round of library screens 



Prow 
ri oy 


r\fc?ciouii iui ociv3L»iiuii 


PPIs 


HIP5 


huntingtin interacting protein verified by in vitro binding 
assay 


8 


PlASy 


huntingtin interacting protein verified by in vitro binding 
assay 


3 


CA150 


huntingtin interacting protein, literature verified 
interaction [HoibertS. etaLProc. NaVAcad. Scr. USA 
98,1811-1816(2001)] 


1 


EF1G 


part of ternary complex with EF1 A, which Is found in htt 
aggregates [Varrwetswinkel S. et al J 
8fo/.C/iem.278 b 43443-51 (2003)] 


1 


HYPA 


huntingtin interacting proteirv literature verified 
interaction [Faber, P.W\ etal. Hum. Mol. Genet.9, 1463- 
1474 (1998)] 


1 ' 


FEZ1 


huntingtin interacting protein verified by in vitro binding 
assay 


AA 


GIT1 


huntingtin Interacting protein verified by in vitro binding 
assay 


AA 


EF1A 


htt aggregate-interacting protein [Mitsui K. et al. J. 
A/eurosc/22,9267-9277 (2002)] 




HIP11 


huntingtin interacting protein verified by in vitro binding 
assay 




NEFL 
p53 


vimentin interacting protein, literature verified interaction 
[Carpenter; DA & Ip; W. J. Cell. SciAO, 2493-2498 
(1996)] 

huntingtin interacting protein, literature verified 
interaction [Steffan, J.S. et al. Proc. Natl. Acad. Sci. 
USA 97, 67636(2000)] 




PLIP 


BARD1 interacting protein, literature verified interaction 
fDechend, R. et al Oncogene^ 331 6-3323 (1999)] 





Abbreviations: DBD, DNA binding domain; PPIs, protein -protein interactions; AA, autoactivation of 
reporter gene. . . 
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Supplementary Tables: Reported interactions in Huntington's disease network 



Reported interactions, found 


Protein A 


Protein B 


Literature 


CA150 


HD1.7 

HDexQ20 

HDexQSI 


HoibertS. et al. Proc. Natl Acad. Sci. USA 98, 1811-1816 (2001). The Gin-Ala repeat transcriptional 
activator CA150 interacts with huntingtin: neuropathologic and genetic evidence for a role in Huntington's 
disease pathogenesis. 


HYPA 


HD1.7 

HDexQ20 

HDexQ51 


Faber, P.W. et al. Hum. MoL Genef.9, 1463-1474 (1998). Huntingtin interacts with a family of WW domain 
proteins. 


HIP1 


HD1.7 


Wanker, E.E.ef at. Hum. MoL Genet 3, 487-495 (1997). HIP-I: a huntingtin interacting protein isolated by the 
yeast two -hybrid system. 


SH3GL3 


HD1.7 

nuexwzu 

HDexQ51 


Sittler, A. et al. MoL Ce/fl, 427-436 (1998). SH3GL3 associates with the Huntingtin exon 1 protein and 
promaiBs xne Torrneuon or puiyyin-cunidiiinjg pruiuui ciyyioyaie;>. 


PlASy 


mp53 


Nelson, V., Davis, G.E. & Maxweil, S A ApoptosisZ, 221-234 (2001 ). A putative protein inhibitor of activated 
STAT (PlASy) interacts with p53 and inhibits p53-mediated transaction but not apoptosis. 


p53 


mp53 


Chene, P. O/7cogene20, 261 1-2617 (2001). The role of tetramerization in p53 function. 

Leblanc V. et al. Anal BiochemZQB, 247-64 (2002). Homogeneous time-resolved fluorescence assay for 

identifying p53 interactions with its protein partners, directly in a cellular extract 


PLIP 


BARD1 


Dechend, R et al. Oncogene!*, 33163323 (1999). The Bcl-3 oncoprotein acts as a bridging factor between 
l^-kappaB/Rel and nuclear co-regulators. 


SUMO-2 


PlASy 


Sachdev, S. et al Genes DevA 5, 3088- 3103 (2001). PlASy, a nuclear matrixassociated SUMO E3 ligase, 
represses LEF1 activity by sequestration into nudear bodies. 


SUMO-3 


PIASy 


Sachdev, S. etal. Genes Dev.15, 3088- 3103 (2001). PlASy, a nuclear matrix-associated SUMO E3 ligase, 
represses LEF1 activity by sequestration into nuclear bodies. J 


EF1G 


EF1G 


Mansilla, F. ef al. Biochem. J.365, 669^676 (2002). Mapping the human translation elongation factor eEF1H 
complex using the yeast two-hybrid system. 


NEFL 
VIMc 


VIM 


Carpenter, DA & Ip, W. J. Cell. SciAQ, 24935498 (1996). Neurofilament triplet protein interactions: 
evidence for the preferred formation of NFL -containing dimers and a putative function for the end domains. 


Reported interactions, not found 


Protein A 


Protein B 


Uterature 


HAP1 


HDexQ20 
HDexQ51 


U, S.H. ef al. J. Biol. Chem. 273, 19220-19227 (1998) A human HAP1 homologue. Cloning, expression, and 
interaction with huntingtin. 

Li, S.H.et al. J. NeurbsciAB, 1261-1269. (1998) interaction of huntingtirvassociated protein with dynactin 
P150Glued. 


HIP1 


CLH-17 


Henry, K.R. ef al. MoL Bio.l CellB, 2607-2625 (2002). Sod5p and clathrin function are important for cortical 
actin organization, endocytosis, and localization of sla2p in yeast, pnteriogs paper] 
Metzier, M. ef al. J. BtoL Chem. 276, 39271-39276 (2001). HIP1 functions in clathrimnediated endocytosis 
through binding to clathrin and adaptor protein 2. 

Waeiter, S. ef al. Hum. Mol. GeneMO, 1807-1817 (2001). The huntingtin interacting protein HIP1 is a 
clathrin and alpha-adaptin-binding protein involved in receptor-mediated endocytosis. 


P 53 


HDexQ20 
HDexQ51 


Steffan, J.S. et al. Proc. Natl. Acad. Sci. USA 97, 6763-6768 (2000). The Huntington's disease protein 
interacts with p53 and CREB-binding protein and represses transcription. 


P 53 


hADA3 


Wang, T. etal. BMBO J.20, 6404S413 (2001). hADA3 is required for p53 activity. 


p53 


' BARD1 


Irminger-Finger, I. et al. MoL Ce//6, 1255-1266 (2001). Identification of BARD1 as mediator between 
proapoptotic stress and p53-dependent apoptosis. 


KPNA2 


KPNB1 


Chook, Y.M. & Blobel, G. Cunr. Opln. Struct. S/o/,6, 703-715 (2001). Karyopherins and nuclear import 
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Supplementary Table Reported huntingtin interacting proteins 



ID 


NAME 


LOCUS ID 


PubMed ID 


Trprrcmpttonal control and DNA maintenance 


CA150 


transcription elongation regulator 1 (TCERG1) 


10915 


11172033 


CREB1 


cAMP responsive element binding protein 1 


1385 


8643525 


CREBBP 


CREB binding protein (Rubinstein-Taybi syndrome) 


1387 


10823891 


CTBP1 


CMerminal binding protein 1. 


1487 


11739372 


HYPA 


fcmnin binding protBin 3 (FNBP3) 


55660 


9700202 


HYPB 


huntingtin interacting protein B 


29072 


9700202 


HYPC 


huntingtin interacting protein C 


25766 


9700202 


NCOR1 


nuclear receptor cor epressor 1 


9611 


10441327 




nuclear factor of kappa light polypeptide gene enhancer in 


4790 




NFKB1 


B-ce!ls1(p105) 


12379151 


PQBP1 


polyglutamine binding protein 1 


10084 


10332029 


REST 


RE1 -silencing transcription factor 


5978 


1288172 


SAP30 


sin3-associated polypeptide, 30kDa 


8819 


10823891;10441327 


SP1 


Sp1 transcription factor 


6667 


11988536 


TAF4 


TAF4 RNA polymerase II 


' 6874 


11988536 


TBP 


TATA box binding protein 


6908 


10410676 


TP53 


tumor protein p53 (L>Fraumeni syndrome) 


7157 


10823891 


Cellular oraanization and protein transport , _ 


AP2A2 


adaptor-related protein complex 2, alpha 2 subunit 


161 


9700202 


DLG4 


discs, large homolog 4 (Drosophila) (PSD95) 


1742 


11319238 


HAP1 


huntingtin-associated protein 1 (neuroan 1) 


9001 


96681 10;9454836 


HIP1 


huntingtin interacting protein 1 


3092 


9147654 


HIP14 


huntingtin Interacting protein 14 


23390 


9700202;12393793 


OPTN 


optineurin (FIP2) 


10133 


9700202:11137014 


PACSIN1 


protein kinase C and casein kinase substrate in neurons 1 


29993 


12354780 


SH3GL3 


SH3-domain GRB2-like 3 


6457 


9809064 


SYMPK 


symplekin ' 


8189 


9700202 


TUBG1 


tubulin, gamma 1 


7283 


11870213 


Cell sianafinq and fate ■ _ 


GRAP 


GRB2-related adaptor protein 


10750 


8612237 


GRB2 


growth factor receptor-bound protein 2 


2885 


9079622 


rrpm 


inositol 1,4,5-triphosphate receptor, type 1 


3708 


12873381 


MAP3K10 


mitogen-activated protein kinase kinase kinase 10 ■ 


4294 


10801775 


PDE1A 


phosphodiesterase 1A, calmodulin -dependent 


5136 


8643525 


RASA1 


RAS p21 protein activator (GTPase activating protein) 1 


5921 


8612237: 9079622 


TGM2 


transglutaminase 2 


7052 


11442349 


TRIP10 


thyroid hormone receptor interactor 10 


9322 


12604778 


Cfillular metabolism 


CBS 


cystathionine-beta-synthase 


875 


9466992; 10434301; 10823891 


GAPD 


gtyceraldehyde3- phosphate dehydrogenase 


2597 


8612237 


TPH1 


tryptophan hydroxylase 1 


7166 


12354289 


Pmtein wnthpsk anrf tiimnvpr 


HIP2 


huntingtin interacting protein 2 


3093 


8702625; 9700202 


Uncharacterized proteins . 


HYPE 


huntingtin interacting protein E 


11153 


9700202 


HYPK 


huntingtin interacting protein HYPK 


25764 


9700202 


HYPM 


huntingtin Interacting protein HYPM 


25763 


9700202 


MAGEA3 


melanoma antigen, family A, 3 


4102 


9700202 



Abbreviations: ID, interacting protein gene symbol; LOCUS ID, NCBI LocusLink Identity; Pubmed ID, NCBI 
PubMed publication index; Reported htt interactors are presented according to databases: MINT, HPRD, BIND; 
Li&Li, Trends Genet. (2004), 20 146-152 and Harjes & Wanker, Trends. Biochem. Sci. (2003), 2 8, 425433. 
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Supplementary Table iff Protein-protein interactions of the extended HP network 



Number 


ID1 


LOCUSID 1 


ID 2 


LOCUSID 2 Reference 


Number 


ID 1 LOCUSID 1 


ID 2 LOCUSID 2 Reference 


1 


ABL1 


25 


CBL 


867 


literature 


63 


BRCA1 


672 


RBBP4 


592B 


literature 


2 


ABL1 


25 


PXN 


5829 


literature 


64 


BRCA1 


672 


RELA 


5970 


literature 


3 


ALEX2 


9823 


ALEX2 


9823 


this study 


65 


CA150 


10915 


LUC7B1 


55692 


this study 


4 


ALK 


238 


SHC1 


6464 


literature 


66 


CA150 


10915 


PIASy 


51588 


this study 


5 


AP2A2 


161 


SHC1 


6464 


literature 


67 


CBL 


867 


SRC 


6714 


literature 


6 


APP1 


333 


EF1A 


1915 


this study • 


68 


CBL 


867 


VAV1 


7409 


literature 


7 


APP1 


333 


BAIP1 


84289 


this study 


69 


CBL 


867 


SH3KBP1 


30011 


literature 


8 


APP1 


333 


GDF9 


2661 


this study 


70 


CBL 


867 


LAT 


27040 


literature 


9 


APP1 


333 


SETBDt 


9869 


this study 


71 


CBL 


867 


SHC1 


6464 


literature 


10 


APP1 


333 


HIP16 


10813 


this study 


72 


CBL 


867 


PIK3R1 


5295 


literature 


11 


APP1 


333 


BA!P3 


55791 


this study 


73 


CBL 


867 


PLCG1 


5335 


literature 


12 


APP1 


333 


HIPS 


57562 


this study 


74 


CBL 


867 


FYN 


2534 


literature 


13 


APP1 


333 


G45IP1 


84060 


this study 


75 


CBL 


867 


PTK2B 


2185 


literature 


14 


AR 


367 


EP300 


2033 


literature 


76 


CBL 


867 


EGFR 


1956 


literature 


15 


AR 


367 


ESR1 


2099 


literature 


77 


CDC2 


983 


PCNA 


5111 


literature 


16 


AR 


367 


RELA 


5970 


literature 


78 


CDC2 


983 


FYN 


2534 


literature 


17 


AR 


367 


BRCA1 


672 


literature 


79 


CGl-74 


51631 


H1P5 


57562 


this study 


18 


AR 


367 


HDAC1 


3065 


literature 


80 


CHUK 


1147 


1KB KB 


3551 


literature 


19 


AR 


367 


NCOA1 


8648 


literature 


81 


CLH-17 


1213 


HGS 


9146 


literature 


20 


AR 


367 


JUN 


3725 


literature 


82 


CLH-17 


1213 


Ku70 


2547 


this study 


21 


AR 


367 


NCOA3 


8202 


literature 


83 


CLK1 


1195 


PIASy 


51588 


this study 


22 


AR 


367 


STAT3 


6774 


literature 


84 


CREB1 


1385 


BRCA1 


672 


literature 


23 


AR 


367 


NR3C1 


2908 


literature 


85 


CREB1 


1385 


NR3C1 


2908 


literature 


24 


BA1P1 


84289 


G45IP3 




this study 


86 


CREBBP 


1387 


MSX1 


4487 


literature 


25 


BAIP3 


55791 


BAIP2 


84078 


this study 


87 


CREBBP- 


1387 


RELA 


5970 


literature 


26 


BAIP3 


55791 


HIP15 


114928 


this study 


88 


CREBBP 


1387 


RBBP4 


5928 


literature 


27 


BAIP3 


55791 


BAIP3 


55791 


this study 


89 


CREBBP 


1387 


PTMA 


5757 


literature 


28 


BAIP3 


55791 


HIP5 


57562 


this study 


90 


CREBBP 


1387 


PPARG 


5468 


literature 


29 


BARD1 


580 


PUP 


10524 


this study 


91 


CREBBP 


1387 


PML 


5371 


literature 


30 


BARD1 


580 


ZHX1 


11244 


this study 


92 


CREBBP 


1387 


MYOD1 


4654 


literature 


31 


BARD1 


580 


POU2F1 


5451 


literature 


93 


CREBBP 


1387 


JUN 


3725 


literature 


32 


BARD1 


580 


BRCA1 


672 


literature 


94 


CREBBP 


1387 


HNF4A 


3172 


literature 


33 


BARD1 


580 


CA150 


10915 


this study 


95 


CREBBP 


1387 


NR3C1 


2908 


• literature 


34 


BARD1 


580 


GIT1 


28964 


-this study 


96 


CREBBP 


1387 


EVI1 


2122 


literature 


35 


BARD1 


580 


IKAP 


8518 


this study 


97 


CREBBP 


1387 


KLF5 


688 


literature 


36 


BARD1 


580 


HB01 


11143 


this study 


98 


CREBBP 


1387 


SRC 


6714 


literature 


37 


BARD1 


580 


CDC2 


983 


literature 


99 


CREBBP 


'1387 


BCL3 


602 


literature 


38 


BARD1 


580 


NAG4 


29117 


this study 


100 


CREBBP 


1387 


TP53 


7157 


literature 


39 


BARD1 


580 


BAIP2 


84078 


this study 


101 


CREBBP 


1387 


BRCA1 


672 


literature 


40 


BARD1 


580 


PIASy 


51588 


this study 


102 


CREBBP 


1387 


WT1 


7490 


literature 


41 


BARD1 


580 


BAIP3 


55791 


this study 


103 


CREBBP 


1387 


NCOA3 


8202 


literature 


42 


BARD1 


580 


H1P5 


57562 


this study 


104 


CREBBP 


1387 


NCOA1 


8648 


literature 


43 


BARD1 


580 


SETBD1 


9869 


this study 


105 


CREBBP 


1387 


KHDRBS1 


10657 


literature 


44 


BARD1 


580 


BCL3 


602 


literature 


106 


CREBBP 


1387 


HIPK2 


28996 


literature 


45 . 


BARD1 


580 


HAP1 


9001 


this study 


\\}f 


CREBBP 


1387 


SREBF2 


6721 


literature 


46 


BARD1 


580 


PTN 


5764 


this study 


108 


CREBBP 


1387 


AR 


367 


literature 


47 


BARD1 


580 


HZFH 


1107 


this study 


109 


CTBP1 


1487 


HDAC2 


3066 


literature 


48 


BARD1 


580 


HIP15 


114928 


this study 


110 . 


CTBP1 


1487 


ZNFN1A1 


10320 


literature 


49 


BARD1 


580 


BAIP1 


84289 


this study 


111 


CTBP1 


1487 


HDAC1 


3065 


literature 


50 


BARD1 


580 


FEZ1 


9638 


this study 


112 


CTBP1 


1487 


EV11 


2122 


literature 


51 


BCL3 


602 


FYN 


2534 


literature 


113 


CTBP1 


1487 


BRCA1 


672 


literature 


52 


BCL3 


602 


RXRA 


6256 


literature 


114 


DLG4 


1742 


HGS 


9146 


literature 


• 53 


BCL3 


602 


JUN 


3725 


literature 


115 


DLG4 


1742 


FYN 


2534 


literature 


54 


BCL3 


602 


SHC1 


6464 


literature 


116 


DLG4 


1742 


PRKCA 


5578. 


literature 


55 


BRCA1 


672 


HDAC2 


3066 


literature 


117 


DLG4 


1742 


DNCL1 


8655 


literature 


56 


BRCA1 


672 


EP300 


2033 


literature 


118 


DLG4 


1742 


ERBB2 


2064 


literature 


57 


BRCA1 


672 


ESR1 


2099 


literature 


119 


DRP-1 


1400 


Huntingtin 


3064 


this study 


58 


BRCA1 


672 


CDC2 


983 


literature 


120 


DRP-1 


1400 


VIM 


7431 


this study 


59 


BRCA1 


672 


HDAC1 


3065 


literature 


121 


EF1A 


1915 


GADD45G 


10912 


this study 


60 


BRCA1 


672 


STAT3 


6774 


literature 


122 


EF1A 


1915 


PLCG1 


5335 


literature 


61 


BRCA1 


672 


JUN 


3725 


literature 


123 


EF1G 


1937 


EF1G 


1937 


this study 


62 


BRCA1 


672 


MYC 


4609 


literature 


124 


EF1G 


1937 


GADD45G 


10912 


this study 
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Number 


1D1- LOCUSID 1 


ID 2 LOCUSID 2 Reference 


Number 


ID 1 


LULrUolU 1 


ID 2 LOCUSID 2 Reference 


125 


borK • 


IbOO 




Of 14 


nieraiure 


A on 


oil 1 


9RQftd 


BA1P3 


55791 


this study 


126 


EGFR 


1956 


PTK2 


5747 


literature 


191 


oin 


OQQCA . 

zoyo^t 


G45IP3 




thfc sturlv 

u no oiiiuy 


127 


EGFR 


1956 


PLCG1 


coon 
boob 


literature 


192 


\j\ 1 1 


OQQfM 


HIPS 


57562 


thfc <?tifrfv 

u no oiuuy 


128 


EGFR 


190D 


rllvJKj 




literature 


193 


oil 1 


zoyo^t 


PXN 


5829 


literature 


129 


EGFR 


1956 


ERBB2 


2064 


literature 


194 


ol 1 1 




PTK2 


5747 




130 


EGFR 


1956 


PDGFRB 


5159 


literature 


195 


PDAD 


lUf ou 


EPOR 


2057 


Irtpraturp 

11 Lw 1 QiUI w 


131 


EGFR 


1956 


PTK2B 


2185 


literature 


196 


no AD 
brVAr 


1 U/ OU 


TNFSF6 


356 


litprafi irp 

IJ LCI QIUI C 


132 


bbrK 


1 yob 


torn 


zuyy 


literature 


197 


r>DAD 

bKAr 


1fY7^ft 
lUf OU 


KIT 


3815 


literature 


133 


EGFR 


1956 


SHC1 ■ 


6464 


literature 


198 


uKAr 


1 u / ou 


3051 


.6654 


litpratiirfi 


134 


EGFR 


1956 


SOS1 


DO04 


literature 


199 


uKAr 


in7^n 
\ u/ ou 


LAT 


27040 


litpraturp 


135 


EP300 


2033 


ING1 


3621 


literature 


2U0 


OrvDZ 


OQQC 
ZOO o 


TP73L 


8626 


litpratnrp 

II LCI CILUI v7 


136 


EP300 


2033 


IML>UA1 


0D40 


literature 


201 


OTCD2 


ZOOO 


PLCG1 


5335 


literature 


137 


EP300 


2033 


HNF4A 


3172 


literature 


202 


oKt>2 


2ooo 


i 1 rvz 


5747 


Htprati irp. 

11 LCI CI LU 1 c 


138 


EP300 


2033 


MDIV12 


4193 


literature 


203 


VjKd2 


ZOOO 


SHC1 


6464 


Ittpratiirp 

IILCI OLUI C 


139 


EP300 


2033 


PCNA 


PJ A A 

5i n 


literature 


204 




ZOOO 


o^JO t 


6654 


lifpratnrp 

II LCI OLUI O 


140 


EP300 


2033 


PTMA 


C7C7 


literature 


205 


oKb2 


zooo 


LAT 


27040 


lirprflti i rft 

UlV/l VJ LU 1 W 


141 


EP300 


2033 


RELA 


5970 


literature 


206 


(jKB2 


2oo5 




6714 


Ittpraturp 

1 1 LCI CI LU 1 w 


142 


EP300 


2033 


STAT3 


6774 


literature 


207 


oKD2 


zooo 


VVnO 


7454 


litpmtt irp 

MICI OLUI C 


143 


EP300 


2033 


ESR1 


2099 


literature 


208 


V3KD2 


20OO 


VWACI 


RQ7R 


literature 


144 


EPOR 


2057 


KIT 


3815 


literature 


209 


GRB2 


2885 


i^WnRRQI 
l\nUr\DO 1 


10657 


ISfprati irp 

IILCJ OLUI c 


145 


EPOR 


2057 


SHC1 


o4o4 


literature 


210 


pnno 


2OO0 


SH3KBP1 


30011 


literature 

IILWI QLUI W 


146 


EPOR 


2057 


VAV1 


7409 


literature 


Zll 




Zoou 


PIfORI 


5295 


litpratiira 

IILCI OLUI C 


147 


rnnn 
EPOR 


2057 




5Za5 


literature 




oKt52 


ZOOO 


RASA1 


5921 


literature 


148 


ERBB2 


2064 


PTK2 


5747 


literature 


04 0 

213 


(jKd2 


2000 


V/AV1 
VnV 1 


7409 


llfprati irp 

ULCI ulL/l C 


149 


ERBB2 


2064 


SHC1 


6464 


literature 


OH A 

214 


oKti/ 


zooo 


DJri\ 




litprati irp 


150 


ERBB2 


2064 


PTK2B 


2185 


literature 


215 


CjRo2 


OQQC 

ZOOO 


ARI 1 
ADL 1 


25 


litprpti irp 

IILCI ulUl C 


151 


ERBB2 


2064 


SOol 


OOD4 


literature 


215 


oKKd 


ZOOO 


TNFSF6 


356 


literature 


152 


ESR1 


2099 


JUN 


3725 


iiteraiure 


217 


bKbZ 


OQQC 

2ooo 


PDGFRB 


51 59 


literature 

iilci ami c 


153 


ESR1 


2099 


MDM2 


4193 


literature 


21 o 


oKD2 


ZOOO 


UtNlvi ( 


1759 


lifprpti irp 

liLci aiui c 


154 


ESR1 


2099 


PIK3R1 


5295 


literature 


OHO 

219 


GRB2 


ZOOO 




2057 


literati irp 


155 


ESR1 


2099 


SHC1 


6464 


literature 


220 


GKB2 


OQQC 

ZOOO 


ERBB2 


2064 


litprattJrp 

ii lci aiui c 


156 


ESR1 


2099 


NCOA3 


8202 


literature 


221 


GRB2 


2685 


prrv"9R - 


2185 


litprati irp 
HLCi aiui c 


157 


ESR1 


2099 


NCOA1 


8648 


literature 


222 


GRB2 


ZOOO 


URAC 

nrvAO 


3265 


Iftpraturp 

IllCt CILUI C 


158 


EVI1 


2122 


HDAC1 


once 


literature 


ooo 
223 


oKb2 


ZOOO 


KIT 


3815 


literature 


159 


FEZ1 


9638 


MMr 


\ flQQQ 

luyoy 


inis siuoy 


OOyl 

224 




ZOOO 


CBL 


867 


literature 


160 


FEZ1 


9638 


BAIP3 


55791 


this study 


225 


oKb2 


zooo 




2260 


literature 

11 LCI UlUl W 


161 


FEZ1 


9638 


(J me 


570DZ 


this study 


22o 


flAUAJ 




EP300 


2033 


literature 


162 


FEZ1 


9638 


G45lro 




this study 


007 


nAUAO 




TP53 


7157 


literature 


163 


FGFR1 


2260 


SHC1 


CMC/1 

6464 


literature 


228 


hADA3 


Hn»7/i 
1U4Y4 


BAIP1 


84289 


this study 


164 


FYN 


2534 


VAV i 


7/tno 


literature 


OOO 

Z2y 


nAUAo 


10A74. 


PIASy 


51588 


this study 


165 . 


FYN 


2534 


bnUl 


o4o4 


literature 


00.0 


llrtUnJ 


10474 
lUf r *f 


MAGEH1 


28986 


this study 


166 


FYN 


2534 


IvnUKbol 


lUOO/ 


literature 


004 


nAUAO 


10474 
1 U*tf *f 


ESR1 


2099 


literature 


167 


FYN 


2534 


WAS 


7454 


literature 


232 


LJADH 

HAP1 


yuoi 


BAIP3 


55791 


thfc ^nriv 

U HO O LUVJ J 


168 


FYN 


2534 


PDGFRB 


5159 


literature 


OOQ 


MAPI 
nArl 




HGS 


9146 


litprahirp 

IILCI CILUI w 


169 


FYN 


2534 


PIK3R1 


5295 


literature 


oox 
2o4 


nArl 


yuu i 


HIPS 


57562 


thfc <;tijriv 

U IIO OlUUjf 


170 


FYN 


2534 




oooo 


literature 
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Number 


ID 1 


LOCUSID 1 


ID 2 
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Abbreviations: ID, interacting protein gene symbol; 
LOCUS ID, NCBI LocusLink Identity. The presented 
list of protein -protein interactions is computed from 
databases: MINT, HPRD, BIND; Li & Li, Trends 
Genet (2004), 20, 146-152 and Harjes & Wanker, 
Trends. Biochem. Sc/. (2003), 28, 425-433. 
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The figures show: 

Figure 1 Identification of two-hybrid interactions connected to HD. a, Schematic 
representation of the screening strategy, b, Identification of interactions by systematic 
interaction mating. Upper panel: Selection of diploid yeast clones by transfer on 
minimal medium lacking leucine and tryptophan (SDH). Lower panel: Two-hybrid 
selection of interactions on minimal medium lacking leucine, tryptophan, histidine and 
uracil (SDIV) after 5 days of growth at 30°C. The prey proteins HP28 (A5), SH3GL3 
(A7), CA150 (B9), HIP15 (B10), PFN2 (B11), HIP13 (C1), CGI125 (C12) and HYPA 
(D1) were identified as HDexQ51 interactors. 

Figure 2 Protein interaction network for Huntington' s disease, a, Matrix of 117 two- 
hybrid interactions between 21 bait and 49 prey proteins, b, Yeast two-hybrid 
interactions depicted as network using the software Pivot 1.0. In total, 96 interactions 
and 61 distinct proteins are depicted. In addition, dimers of EF1G, VIM and p53 are 
shown. 

Figure 3 Systematic validation of two-hybrid interactions by in vitro binding 
experiments. GST-fusion proteins (baits) immobilised on glutathione agarose beads 
were incubated with COS1 cell extracts containing HA-tagged prey proteins. After 
extensive washing of the beads, bound proteins were eluted and analysed by SDS- 
PAGE and immunoblotting using anti-HA antibody. 

Figure 4 Identification of network proteins stimulating htt aggregation, a, Filter 
retardation assay. Protein extracts were prepared from HEK293 cells coexpressing 
HD169Q68 and network proteins as indicated. The aggregated proteins retained on 
the filter were detected with anti-htt antibody (CAG53b) and anti-GIT1 antibody, b, 
Coimmunoprecipitation of HD510Q68 arid GIT1 from COS1 cell extracts. Extracts 
were incubated with anti-GIT1 or preimmune serum. Immunoprecipitated material 
was analysed by immunoblotting using htt- antibody 4C8 and anti-HA antibody, c, 
Coimmunoprecipitation of htt and GIT1 from human brain extracts. Protein 
complexes containing GIT1 were pulled-down with increasing amounts of anti-htt 
antibodies, but not with corresponding preimmune sera, d, Analysis of subcellular 
localisation of HD510Q68 and GIT1 by immunofluorescence microscopy. COS1 cells 
were transfected with the indicated constructs and immunolabled with 4C8 anti-htt 
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antibody coupled to Cy3-conjugated antibody (red) and with anti-HA antibody 
coupled to FITC-conjugated antibody (green). Nuclei were counterstained with 
Hoechst (blue). Colocalisation of HD510Q68 and GIT1 is illustrated by yellow colour 
of the insoluble aggregates. Scale bars, 10 urn. 

Figure 5 Detection of GIT1 in brains of R6/1 transgenic mice and HD patients, a, 
Sections of striatum and cortex of R6/1 mice brains labelled with anti-GIT1 and anti- 
htt (EM48) antisera. Arrows point to nuclear inclusions, b, Inclusions in cortex of HD 
patients are labelled with anti-htt (2B4) and anti-GIT1 antibodies. Arrows indicate 
neuronal inclusions, recognized by anti-htt (2B4) and anti-GIT1 antibodies. Scale 
bars, 20 urn. c, Colocalisation of GIT1 and htt in the cortex of HD patients detected 
by immunofluorescence microscopy. 

Figure 6 Amino acid sequence of the interacting proteins of the PPI of huntingtin. 

Figure 7 Identification of Y2H interactions connected to HD. A, The screening 
strategy. B, Identification of interactions by systematic interaction mating. Upper 
panel: Selection of diploid yeast clones on SDN minimal medium. Lower panel: Two- 
hybrid selection of interactions on SDIV minimal medium. The prey proteins HP28 
(A5), SH3GL3 (A7), CA150 (B9), HIP15 (B10), PFN2 (B11), HIP13 (C1), CGI125 
. (C12), and HYPA (D1) were identified as HDexQ51 interactors. 

Figure 8 A protein interaction network for Huntington' s disease. A, Matrix of 186 
Y2H interactions between 35 bait and 51 prey proteins. Interactions reported 
previously (30), or verified in pull down assays (35) are indicated. B, A 
comprehensive PPI network for htt. Y2H interactors identified in this study (red 
diamonds), previously published interactors (blue squares), interactors identified from 
databases HRPD, MINT and BIND, bridging any two proteins in the extended 
network (green triangles, Suppl. Table 5). Htt interactors previously reported and 
found in our screens (CA150, HYPA, HIP1 , and SH3GL3), depicted as red squares. 

Figure 9 Validation of Y2H interactions by in vitro binding experiments. GST-fusion 
proteins immobilized on glutathione agarose beads were incubated with COS-1 cell 
extracts containing HA-tagged proteins. After extensive washing, pulled proteins 
were eluted and analyzed by SDS-PAGE and immunoblotting using anti-htt 4C8 or 
anti-HA antibodies. 
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Figure 10 GIT1 enhances and is critical for htt aggregation. A, Filter retardation 
assay for the identification of GIT1 as a promoter of htt aggregation. 48 h post 
transfection, protein extracts were prepared from HEK293 cells coexpressing 
HD169Q68 and GIT1-CT (aa 249-770). Aggregated proteins retained on the filter 
were detected with anti-htt (CAG53b) or anti-C-GIT1 antibody. B, Effect of full-length 
GIT1 on HD169Q68 aggregation analyzed by the filter retardation assay. C, Analysis 
of HD169Q68 aggregation in cells overexpressing GIT1-CT by indirect 
immunofluorescence microscopy, a, HD169Q68 (red), b, GIT1-CT (green), c, 
Colocalization of GIT1 with the endosomal marker EEA1 is indicated in yellow, d-f, 
> Colocalization of HD169Q68 (red) and GIT1-CT (green) in COS-1 cells. D, Silencing 
of endogenous GIT1 expression. HEK293 cells transfected with the siRNA-GIT1 
were analyzed after 48 h by immunoblotting using anti-C-GIT1 and anti-GAPDH 
antibodies. E, Silencing of endogenous GIT1 prevents the accumulation of insoluble 
htt aggregates. siRNA-GIT1 treated and untreated cells expressing HD169Q68 were 
analyzed 72 h post transfection by filtration. 

Figure 11 Verification of the htt-GIT1 interaction. A, Coimmunoprecipitation of 
HD510Q68 and HA-GIT1-CT from COS-1 cell extracts using anti-C-GIT1 antibody. 
Immunoprecipitated material was analyzed by immunoblotting, using the anti-HA 
12CA5 antibody detecting recombinant GIT1 (upper blot) and the htt-4C8 antibody 
(lower blot). B, Coimmunoprecipitation of htt and GIT1 from human brain extracts. C, 
Subcellular localization of GIT1 and htt in differentiated PC12 cells (a-c) and SH- 
SY5Y cells (d-f) by confocal immunofluorescence microscopy. Colocalization of htt 
and Gl'i I shown in yellow (panel c and f). Arrow points to cytoplasmic structures 
recognized by both antibodies. In addition, specific GIT1 labeling was detected at the 
tip of neurite-like extensions in adhesion foci (arrowheads). Scale bars, 10 urn. 

Figure 12 Detection of GIT1 in brains of transgenic mice and HD patients. A, 
Sections of striatum and cortex of R6/1 mice brain labeled with anti-C-GIT1 and anti- 
htt EM48 antibodies. Arrows point to nuclear inclusions. B, Neuronal inclusions 
(arrows) in cortex of HD patients recognized by anti-htt 2B4 and anti-C-GIT1 
antibodies. Scale bars, 20 urn. C, Colocalization of GIT1 and htt in the cortex of HD 
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patients, detected by immunofluorescence microscopy. D, Detection of N-terminally 
truncated GIT1 degradation products in HD patient brain cortex. 

Figure 13 Specificity of GIT1 antibodies. A, Scheme indicating the regions of GIT1, 
which were used for the production of antibodies. NT-GIT 1 antibody recognizes the 
N-terminal part (aa 1-100), C-GIT1 the central part (aa 368-587) and CT-GIT1 the C- 
terminal part (aa 664-770) of GIT1. B, Analysis of the specificity of the GIT1 
antibodies. All three antibodies specifically recognize GIT1 , but not the homologous 
protein GIT2 (Premont et al, 2000). After expression of full length HAGIT1 and HA- 
GIT2 15 ug of total COS-1 cell extract was subjected to SDS-PAGE. Immunoblotting 
i was performed with anti-NT-GIT1 (1:500), anti-C-GIT1 (1:500) and anti-CT-GIT1 
(1:500) antibodies. Expression of HA-GIT1 and HA-GIT2 was detected with an anti- 
HA antibody (1:1 000). 
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The examples illustrate the invention: 

PART I: ESTABLISHING THE PROTEIN-INTERACTION NETWORK OF 
HUNTINGTIN 

Examples 1 : Particular methods and material used in the Examples 

• Antibodies, strains and plasmids 

A polyclonal antibody (pAb) against GIT1 was generated by injection of affinity 
purified His 6 -tagged GIT1 (residues 368-587) into a rabbit. The htt-specific pAb 
CAG53b and HD1 were described 13,14 . Commercially available antibodies were anti- 
GST pAb (Amersham Pharmacia), anti-GIT1 pAb (Santa Cruz Biotechnology), anti- 
HA monoclonal antibody 12CA5 (mAb) (Roche Diagnostics), anti-htt pAb EM48 47 , 
anti-htt mAb 2B4 48 and anti-htt mAb 4C8 (Chemicon). As secondary antibodies for 
immunofluorescence microscopy Cy3- and FITC-conjugated IgGs (Jackson 
ImmunoResearch) were used. The yeast strains used as two-hybrid reporters were 
L40ccua [MATa his3A200 trp1-901 Ieu2-3,112 LYS2::(lexAop) 4 -HIS3 ura3::(lexAop) 8 - 
lacZ ADE2::(lexAop) 8 - URA3 GAL4 gal80 can1 cyh2] and L40ccoc [MATa his3A200 
trp1-910 Ieu2-3,112 ade2 LYS2::(lexAop) 4 -HIS3 URA3::(lexAop) 8 -lacZ GAL4 gal80 
can1 cyh2]. Both strains are derivatives of L40c 17 . Plasmids pHD510Q17 and 
pHD510Q68 were generated by insertion of fragments coding for HD510Q17 and 
HD510Q68 into pcDNA-l (Invitrogen). pHD169Q68 was derived from pHD510Q68 by 
deletion of the Xhol- Xhol fragment encoding aa 170-510 of human htt. 

• Library screening 

Plasmids encoding bait proteins were transformed into the strain L40ccua, tested for 
the absence of reporter gene activity and cotransformed with a human fetal brain 
cDNA library (Clontech). For each transformation 1 x 10 6 independent transformants 
were plated onto minimal medium lacking tryptophan, leucine, histidine and uracil 
(SDIV medium) and incubated at 30°C for 5 to 10 days. Clones were picked into 
microtitre plates using a picking robot and grown over night in liquid minimal medium 
lacking tryptophan and leucine (SDII medium). Then, they were spotted onto nylon or 
nitrocellulose membranes placed on SDIV medium plates. After incubation for 4 days 
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membranes were subjected to a (3-galactosidase (p-GAL) assay. Plasmids were 
prepared from positive clones and characterised by restriction analyses and 
sequencing. For retransformation assays plasmids encoding bait and prey proteins 
were cotransformed in the yeast strain L40ccua and plated onto SDIV medium. 

• Array mating screen 

Plasmids encoding bait and prey proteins were transformed into strains L40ccua and 
L40cca, respectively. L40cca clones were arrayed in 96-weII microtitre plates and 
mixed with a single L40ccua clone for interaction mating. Diploid cells were 
transferred by a robot (Beckman, Biomek® 2000) onto YPD medium plates and, after 
incubation for 24 h at 30°C, onto SDN medium plates for additional 72 h at 30°C. For 
two-hybrid selection diploid cells were transferred onto SDIV medium plates with and 
without nylon or nitrocellulose membranes and incubated for 5 days at 30°C. The 
nylon or nitrocellulose membranes were subjected to the (3-GAL assay. Positive 
clones were verified by cotransformation assays using plasmids encoding respective 
bait and prey proteins. 

• Protein expression and verification assays 

For verification experiments cDNA fragments encoding baits and preys were 
subcloned into pGEX derivatives (Stratagene) or pTL-HA 18 . GST fusion proteins 
were expressed in E. coli BL21-codon PlusTM RP (Stratagene) and affinity purified 
on glutathione agarose beads (Sigma) using standard protocols 17 . COS1 cells were 
transfected with mammalian expression plasmids and lysed as described 18 . For in 
vitro binding assays, 30 \ig of GST or GST fusion protein were immobilized on 
glutathione agarose beads and incubated with 500 \sg protein extract prepared from 
COS1 cells expressing a HA-tagged fusion protein for 2 h at 4°C in binding buffer [50 
mM HEPES pH 7.4, 150 mM NaCI, 10% glycerol, 1 % NP-40, 1 mM EDTA, 20 mM 
NaF, 1 mM DTT, 0.1 % Triton X-100, protease inhibitors (Roche Diagnostics)]. After 
centrifugation and extensive washing of the beads bound proteins were eluted and 
analysed by SDS-PAGE and Western blotting. Coimmunoprecipitation experiments 
were performed as described by Sittler ef a/. 18 . For immunofluorescence microscopy 
COS1 cells were grown on cover slips and cotransfected with pcDNA-HD510Q68 
and pTL-HA-GIT1. 40 h post transfection cells were fixed with 2% paraformaldehyde. 
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Standard protocols for staining with appropriate primary and secondary antibodies 
were used 18 . 

• Filter Retardation Assay 

HEK293 cells coexpressing HD169Q68 and GIT1, PIASy, HIP5, HP28, PFN2, FEZ1 
or BARD1 were harvested 48 h post transfection. Cells were lysed as described 18 
and boiled in 2% SDS, 100 mM DTT for 5 min. Aliquots containing 50, 25 and 12.5 
pg of total protein were used for filtration on a cellulose acetate membrane 14 . SDS- 
resistant aggregates were detected using anti-CAG53b or anti-GIT1 antibodies. 

• Immunocytochemistry 

Mice were deeply anaesthetised and perfused through the left cardiac ventricle with 
4% paraformaldehyde in 0.1 M phosphate buffer. Brains were removed and postfixed 
overnight in 4% paraformaldehyde. Sections were processed for 
immunocytochemistry as described 47 . pAb EM48 (1:1000) and affinity purified anti- 
GIT1 pAb (1:100) were used as primary antibodies. 

Six human HD and 5 control brains were used in this study. Two HD cases were 
classified as grade 3 and four cases as grade 4 of neuropathological severity. For 
immunolabelling standard protocols were used 48 . 2B4 mAb (1:200) and affinity 
purified GIT1 pAb (1:50) were used as primary antibodies. 

Example 2: Two-hybrid screens and data management 

To generate a PPI network for HD we used a combination of library and matrix yeast 
two-hybrid screens (Fig. 1a). First, 50 selected cDNAs encoding proteins potentially 
involved in HD including 10 different htt fragments were cloned into a DNA binding 
domain vector for expression of LexA fusion proteins (baits). The resulting plasmids 
were introduced into yeast strain L40ccua, which carries three reporter genes, H/S3, 
URA3 and lacZ, for two-hybrid interaction analyses. Forty baits did not activate the 
reporters by themselves and were used individually for cotransformation screening of 
a human fetal brain cDNA library expressing GAL4 activation domain hybrids (preys). 
In each screen, 1 x 10 6 auxotrophic transformants were tested on selective plates, 
and 1-50 positive colonies were typically obtained. Restriction analyses and 
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sequencing identified preys that together with their respective baits repeatedly 
activated the reporter genes. Starting with 40 baits in the first round of 
cotransformation screens we identified 34 PPIs for 10 baits (Table 1). 

In the second round of screens, 12 cDNA fragments encoding preys identified in the 
first screen were subcloned into a DNA binding domain vector. The resulting baits 
were tested for autoactivation and 10 were screened against a human fetal brain 
cDNA library. Four of the 10 proteins revealed additional 13 PPIs. 

Finally, an array mating screen was performed to connect all baits and preys 
identified in the transformation screens. For this assay, MATa yeast cultures were 

• transformed with plasmids encoding prey proteins and arrayed in 96-well microtitre 
plates for interaction mating with individual MATa strains expressing bait proteins. 
Using this strategy each bait was individually tested for interaction with every prey in 
the array. Diploid yeast clones, formed by mating on YPD plates, were selected on 
agar SDII plates, and further transferred by a spotting robot on SDIV plates to select 

; for Y2H interactions (Fig. 1 b). We examined 3500 pairwise combinations of baits and 
preys in the mating assay and identified additional 70 PPIs. These interactions could 
be confirmed in cotransformation assays (Table 5). 

Table 5: 



Summary of two-hybrid screens 





baits 


preys 


baits yielding 


interactions 


Screen 


screened 


screened 


interactions 


identified 


Isttransfr mation screen 


40 


4x1 0 7 


10 


34 


2nd transformation screen 


10 


1x10 7 


4 


13 


Array mating screen 


50 


70 


21 


70 



Thus, the combination of cDNA library and array mating screens proved powerful in 
establishing a highly connected PPI network linked to htt. 



WO 2004/113566 



78 



PCT/EP2004/006617 



Sequence analysis of the cDNAs encoding bait and prey proteins revealed ORFs 
ranging from 82 to 728 amino acids in size (Table 2). In a systematic Blast search 60 
out of the 67 proteins identified were identical to a SwissProt or TrEMBL protein entry 
(http://us.expasy.org/sprot/). The remaining 7 proteins showed 75-99 % identity to its 
best fit and either contained single amino acid substitutions, variable polyQ lengths or 
small regions of sequence variation. Uncharacterised proteins were named according 
to their interaction partners. Each ORF was further examined for consensus protein 
domains using the FprintScan, HMMPfam, HMMSmart, ProfileScan, and 
BlastProDom programs providing useful hints to protein function. For example, the 
protein BAIP1 (BARD1 interacting protein 1) possesses a Zn-finger-like PHD finger 
that is believed to be important for chromatin-mediated transcriptional regulation. 
Similarly, domain searches for BAIP2 (BARD1 interacting protein 2) revealed a 
BTB/POZ domain, a motif found in developmentally regulated zinc finger proteins of 
the Kelch family of actin-associated proteins. Thus, BAIP2 could potentially mediate 
the association of BARD1 with the actin cytoskeleton. 

Example 3: Analysis and functional assignment of the two-hybrid data 

Our two-hybrid screens identified a total of 117 PPIs between 70 protein fragments. 
As a result of the iterative two-hybrid strategy all interactions could be depicted in a 
single large network. The number of interactions identified for each bait varied from 1 
to 18, with each protein having 1.6 interaction partners on average! In order to 
display the PPI data, both matrix and network representations were used (Fig. 2). 
The matrix shows, in addition to the two-hybrid interactions, previously reported 
interactions and interactions verified by independent methods (Fig. 2a). In 
comparison, the network view allows to immediately recognize local PPI patterns and 
paths connecting two proteins in the network (Fig. 2b). Interestingly, proteins such as 
htt, BARD1, GADD45G, HIP5, PIASy or VIM interact with more than 11 other 
proteins forming nodes within the HD network, while 30 proteins have only one 
interaction partner and thus are located at the periphery of the network (Fig. 2b). 
Indeed, all other proteins a.e embedded in many bi-fan motifs and multiple circular 
interaction clusters that have been interpreted to be an indication for biological 
relevance 11,19 . Schwikowski et al. 20 defined network proteins, which are separated 
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by no more than two other proteins, as being part of a functional cluster. In this 
respect all proteins in our network form a functional cluster with htt. 

We assigned a subcellular localisation to each protein by examining various sources 
of literature and based on available experimental data we grouped the proteins into 
six broad functional categories (Fig. 2a, Table 2). 

Eighteen proteins in the HD network are involved in transcriptional regulation or DNA 
maintenance (Fig. 2a). The second largest group, 14 proteins, includes mainly 
cytoskeietal and transport proteins. We assigned 5 proteins to cellular signalling and 
fate, another 4 proteins to protein synthesis and turnover, and 3 proteins to cellular 
metabolism. Being part of 41 interactions, 16 proteins of unknown function were 
identified. 

For the analysis of htt PPIs, as much as 40 out of 1 17 interactions (34,2%) included a 
htt fragment (Fig. 2a). In total, 19 different htt interacting partners from various 
functional groups were detected, 4 proteins had been previously described and 6 
involved proteins of unknown function. Surprisingly, most htt partners (6) are involved 
in transcriptional regulation and DNA maintenance, but others function in cell 
organization and transport (4), cellular signalling (2), or cellular metabolism (1), 
suggesting that htt functions in different subcellular processes. 

The current hypothesis that htt has a function in transcriptional regulation is inferred 
from its interactions with transcriptional activators, coactivators or repressors 21 . In 
agreement with previous reports, binding of htt to CA150 22 and HYPA 23 has been 
detected in our screens. In addition, new connections to nuclear proteins such as 
SETBD1, PLIP and HB01 were found. These multidomain proteins act on histones 
and are known modulators of chromatin structure and gene expression. Similarly, the 
zinc finger bromo domain containing proteins BARD1, NAG4, HZFH, ZHX1, ZNF33B 
play a role in transcriptional control. The protein IKAP directly interacts with htt and 
was recently shown to be part of a complex regulating RNA polymerase II activity 24 . 
Htt also interacts with PIASy, which inhibits transcription factor STAT-mediated gene 
activation 25 . PIASy functions as SUMO E3 ligase for the Wnt-responsive 
transcription factor LEF1, inhibiting its activity via sumoylation 26 . This suggests that 
PIASy catalysed sumoylation of transcription factors could represent a general 
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mechanism in repression of gene expression. The binding of PIASy to htt indicates 
that htt may itself be a substrate for sumoylation. Alternatively, it could influence the 
sumoylation of other transcription factors. Thus, our data extend the nuclear role of 
htt and provide additional leads for its involvement in transcriptional regulation. 

Another large group of htt interactors identified here are proteins that function in 
cellular organization and vesicle transport. We report a new interaction between htt 
and dynein light chain (HP28), a component of the dynein/dynactin motor protein 
complex. Interestingly, the p150 Gk,ed subunit of dynactin is linked to the htt-associated 
protein HAP1 16,27 . Our observation that htt directly binds to HP28 underscores the 
potential scaffolding role of htt/HAPl in dynein/dynactin driven retrograde vesicle 
transport along microtubules in axons. 

The htt interacting protein HIP1 anchors clathrin-coated vesicles to the cytoskeleton 
via its actin-binding domain, a link crucial for synaptic vesicle endocytosis 28 . Here, a 
new PPI between htt and profilin II (PFN2) 29 was detected. PFN2, a protein enriched 
in neurons, modulates actin polymerization in vitro and is involved in endocytosis via 
association with scaffolding proteins 29 . The htt-PFN2 connection adds support to a 
potential role of htt in modulation of both actin polymerization and vesicle transport 
processes. 

Currently, for the function of 6 htt interactors, including HIP5, no genetic or 
biochemical evidence is available (Table 2). We found that HIPS binds to htt as well 
as to karyopherin a (KPNA2). KPNA2 serves as an adapter for karyopherin (3 
(KPNB1), which transports NLS-tagged proteins into the nucleus 30 . Thus, HIPS 
might take this route to the nucleus. Interestingly, HEAT or armadillo (ARM) repeats, 
forming a-helical structures in KPNA2 and KPNB1 are also present in htt 31 . 
Therefore, the complexes between KPNA2 and HIPS as well as between htt and 
HIPS could be similar in terms of protein structure. It is tempting to further speculate 
that htt participates in nucleocytoplasmic transport. 

Example 3:Verification of PtMs 

Comparison with literature-cited interactions revealed that more than 80% of the two- 
hybrid interactions identified here are novel. For all network bait and prey proteins 
only 24 PPIs have been reported previously using two-hybrid methods, 
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coimmunoprecipitations or affinity chromatography-based techniques; 18 of these 
were confirmed in our Y2H assays (Fig 2a, Table 2). Failure to detect interactions 
may result from the high stringency of our particular two-hybrid system. However, in 
most cases the occurrence of false negatives can be explained by the lack of 
essential domains in one of the protein fragments used. For example, an interaction 
between p53 and hADA3 has been described 32 , with the first 214 amino acids of 
hADA3 being essential for this interaction. It escaped our two-hybrid analysis, 
because a C-terminal hADA3 fragment (amino acids 235^*32) was used. For the 
same reason, an interaction between p53 and BARD1 or between KPNA2 and 
KPNB1 was not observed. 

Beside false negatives, the two-hybrid assay is also prone to create false positive 
results 9 . Addressing this issue we performed a series of pull-down and overlay 
assays and thereby confirmed several of the two-hybrid PPIs independently. Proteins 
were expressed as GST-fusions in E. coli and as HA-fusions in COS1 cells. After 
immobilization of the GST-fusion protein to beads or nitrocellulose membranes the 
respective partner was affinity-purified from a COS1 cell extract and binding was 
detected by immunoblotting. Using these assays, 22 physical interactions, central to 
the HD network, were verified (Fig. 2a). The results of some in vitro GST pull-down 
assays are shown in Fig. 3. For example HD510Q17 interacts with HIP1, GIT1, 
PIASy, FEZ1 and HIP11, and HIP5 binds to HD510Q68, GIT1, HB01, PLIP and 
FEZ1 (Fig. 3). In total, 35 two-hybrid interactions were verified independently either in 
previous studies or by our in vitro binding assays (Fig. 2a). 

Example 4: GIT1 promotes htt aggregation in vivo 

The formation of insoluble polyQ-containing protein aggregates is a pathological 
hallmark of HD. Several lines of evidence link htt aggregation to disease progression 
and the development of motor symptoms. We screened network proteins for their 
potential to enhance htt aggregation in a cell-based aggregation assay 14 . In this 
assay, formation of SDS-insoluble htt aggregates in mammalian cells, that have been 
cotransfected with constructs encoding an N-terminal htt fragment with 68 glutamines 
(HD169Q68) and a network protein of interest, is monitored by filter retardation 14 . 
HD169Q68 perse has only a low propensity to form insoluble aggregates in HEK293 
cells. However, as shown in Fig. 4a coexpression of the htt-interacting protein GIT1 
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strongly promotes the formation of HD169Q68 aggregates, whereas coexpression of 
PIASy, HIP5, HP28, PFN2, FEZ1 and BARD1 has no discemable effect. Thus, GIT1 
is a potential modifier of HD pathogenesis, which may influence the rate of formation 
of insoluble htt aggregates in vivo. 

Furthermore, probing of the insoluble HD169Q68 aggregates with an anti-GIT1 
antibody revealed that GIT1 does not only stimulate aggregation but is also an 
integral part of the insoluble aggregates (Fig. 4a). This suggests that GIT1 promotes 
aggregation through direct binding to mutant htt. 

The interaction between GIT1 and htt was confirmed by coimmunoprecipitation from 
i COS1 cells transfected with constructs encoding HD510Q68 and HA-GIT1. Forty 
hours post transfection cell extracts were prepared and treated with antiserum 
against GIT1. HD510Q68 and HA-GIT1 were detected in the immunoprecipitate on 
Western blots with anti-htt antibody 4C8 and anti-HA antibody 12CA5, respectively 
(Fig. 4b). 

The GIT1-htt interaction was also detected in human brain. Protein extracts prepared 
from human cortex were treated with the anti-htt antibodies CAG53b and HD1, and 
the precipitate was probed for the presence of GIT1 (Fig. 4c). Full length GIT1, 
migrating at about 90 kDa 33 , was precipitated by both anti-htt antibodies in a 
concentration dependent manner, indicating the existence of a complex between htt 
and GIT1 in neurons. 

Finally, we performed colocalisation studies of htt and GIT1 in COS1 cells using 
immunofluorescence microscopy. In cells expressing HD510Q68 or GIT1 alone a 
diffuse cytoplasmic staining was observed for each protein (Fig. 4d). However, when 
GIT1 and mutant htt were coexpressed, large perinuclear structures, most likely 
reflecting protein aggregates, appeared almost exclusively. These structures 
contained both GIT1 and htt. The images further substantiate the findings that GIT1 
and htt bind to each other and that GIT1 is a potent enhancer of mutant htt 
aggregation. 

Example 5: GIT1 localises to htt aggregates in HD transgenic mouse and 
patient brains 
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The finding of colocalisation of htt and GIT1 within aggregates in transfected COS1 
cells suggests that GIT1 might also be a component of htt aggregates in vivo. To 
investigate this possibility we first assessed the distribution of GIT1 in brains of R6/1 
transgenic mice expressing a human htt exon 1 protein with 150 glutamines 34 . In 
wildtype mice, GIT1 immunoreaction product was found diffuse in the cytoplasm and 
nuclei of neurons throughout the brain. In R6/1 brains, in addition to the diffuse 
staining, GIT1 immunoreactivity was also present in large nuclear and cytoplasmic 
puncta similar to htt aggregates (Figure 5a). To further confirm these data, we 
examined the subcellular distribution of GIT1 in cortex from HD patient brains and 
healthy individuals (Fig. 5b). In patient brains, GIT1 antibodies labelled neuronal 
nuclear inclusions as well as neuropil aggregates characteristic of HD brains 35 . In 
contrast, neurons from control brains only showed a diffuse nuclear and cytoplasmic 
GIT1 immunostaining. In fact, in colocalisation studies performed in HD brain 
sections, GIT1 positive aggregates were also labelled with anti-htt antibody 2B4, 
indicating that both proteins coaggregated in vivo (Fig. 5c). This observation raises 
the possibility that an alteration of the neuronal GIT1 subcellular distribution 
contributes to HD pathogenesis. 

PART II: VERIFICATION AND FURTHER RESULTS 

Example 6: Experimental Procedures 

• Antibodies 

A polyclonal antibody (pAb) against GIT1 was generated by injection of purified His6- 
tagged GIT1 (aa 368-587) into a rabbit. The resulting GIT1 pAb (C-GIT1) was affinity 
purified using immobilized GIT1 protein. The pAb NT-GIT1 recognizes the first 100 
aa of GIT1 (Santa Cruz Biotechnology), the monoclonal antibody (mAb) CT-GIT1 
(Transduction Laboratories) is specific for the last 10 6 amino acids of GIT1. For all 
three Abs, no cross-reaction with GIT2 was observed (Fig. 13). The pAbs against 
GAPDH (Wanker et al., 1997) and htt [CAG53b (Davies et al., 1997) and HD1 
(Scherzinger et al., 1997)] were described. Commercially available antibodies were 
anti-GST pAb (Amersham Pharmacia), anti-HA mAb 12CA5 (Roche Diagnostics), 
anti-htt pAb EM48 (Gutekunst et al., 1999), anti-htt mAb 2B4 (Lunkes et al., 2002), 
anti-htt mAb 4C8 (Chemicon) and anti-EEA1 pAb (Santa Cruz Biotechnology). As 
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secondary antibodies for immunofluorescence microscopy, Cy3- (dianova) and Alexa 
488- (MoBiTec) conjugated IgGs were used. 



e Strains and plasmids 

The yeast strains used for two-hybrid analysis were L40ccua [MATa his3D200 trpl- 
901 Ieu2-3,112 LYS2::(lexAop)4-HIS3 ura3::(lexAop)8-lacZ ADE2::(lexAop)8- URA3 
GAL4 gal80 can1 cyh2] and L40cca [MATa his3D200 trp1-910 Ieu2-3,112 ade2 
LYS2::(lexAop)4-HIS3 URA3::(lexAop)8-lacZ GAL4 gal80 can1 cyh2]. 

Plasmids pHD510Q17 and pHD510Q68 were generated by insertion of fragments 
coding for HD510Q17 and HD510Q68 into pcDNA-l (Invitrogen). pHD169Q68 was 
derived from pHD510Q68 by deletion of the Xhol-Xhol fragment encoding aa 170- 
510 of human htt. pV5-HD169Q68 was generated by inserting the EcoRI-Xhol 
fragment from pHD510Q68 into pcDNA3.1A/5-HIS (Invitrogen). Full-length GIT1 (aa 
1-770) was amplified by PCR from the cDNA clone IMAGp958H111245Q2 (RZPD, 
Germany) using the primers GIT1-F/GIT1-R and subcloned into the EcoRI-Bglll site 
of pTL-HA (HA-GIT1). The GIT2 cDNA (aa 1-759) was PCR amplified with the 
primers GIT2-F / GIT2-R and subcloned into the Xhol-Notl site of pTL-HA (HA-GIT2). 
The primer sequences were as follows: GIT1-F (5'- 
CGGAATTCATGTCCCGAAAGGGGCCGCG-3'), GIT1-R (5 '-GGAAG ATCT 
GGTCACTGCTTCTTCTCTCGGG-3 '), GIT2-F (5'-ACGCGTCGACCATGTCGAAA 
CGGCTCCG-3') and GIT2-R (5'ATAAGAATGCGGCCGCGCCCTGCCCTTGCTA 
GTTG -3'). 



• Library screening 

Plasmids encoding baits were transformed into L40ccua, tested for the absence of 
reporter gene activity and jotransformed with a human fetal brain cDNA library 
(Clontech). For each transformation, 1 x 10 6 independent transformants were plated 
onto minimal medium lacking tryptophan, leucine, histidine and uracil (SDIV medium) 
and incubated at 30°C for 5 to 10 days. Clones were picked into microtitre plates and 
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grown overnight in liquid minimal medium lacking tryptophan and leucine (SDN 
medium). Then, they were spotted onto nylon membranes placed on SDIV agar 
plates. After incubation for 4 days, the membranes were subjected to a b- 
galactosidase (b-GAL) assay. Plasmids were prepared from positive clones and 
characterized by sequencing. For retransformation assays, plasmids encoding baits 
and preys were cotransformed into L40ccua and plated onto SDIV medium. 



• Array mating screen 

Plasmids encoding baits and preys were transformed into strains L40ccua and 
L40cca, respectively. L40cca clones were arrayed in 96-well microtitre plates and 
mixed with a single L40ccua clone for interaction mating. Diploid cells were 
transferred onto YPD medium plates and, after incubation for 24 h at 30°C, onto SDH 
medium plates for additional 72 h at 30°C. For two-hybrid selection, diploid cells were 
transferred onto SDIV medium plates with and without nylon membranes and 
incubated for 5 days at 30°C. The nylon membranes were subjected to the b-GAL 
assay. Positive clones were verified by cotransformation assays. 



• Protein expression and verification assays 

For verification experiments, cDNA fragments encoding baits and preys were 
subcloned into pGEX derivatives (Stratagene) or pTL-HA (Sittler et al., 1998). GST- 
fusion proteins were expressed in E. coli BL21-codon PlusTM RP (Stratagene) and 
affinity purified on glutathione agarose beads (Sigma) (Wanker et al., 1997). COS-1 
cells were transfected with mammalian expression plasmids and lysed as described 
(Sittler et al., 1998). For in vitro binding assays, 30 pg of GST or GST fusion protein 
were immobilized on glutathione agarose beads and incubated with 500 pg COS-1 
cell extract containing HA-tagged fusion protein for 2 h at 4°C, in binding buffer [50 
mM HEPES-KOH pH 7.4, 150 mM NaCI, 10% glycerol, 1% NP-40, 1 mM EDTA, 20 
mM NaF, 1 mM DTT, 0.1% Triton X-1 00, protease inhibitors (Roche Diagnostics)]. 
After centrifugation and extensive washing, bound proteins were eluted and analyzed 
by SDS-PAGE and Western blotting. 
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Coimmunoprecipitation experiments were performed as previously described (Sittler 
et al., 1998). For immunofluorescence microscopy, COS-1 cells were grown on cover 
slips and cotransfected with plasmids encoding N-terminal htt V5-HD169Q68 and/or 
C-terminal HA-GIT1-CT. 40 h post-transfection, cells were treated with 2% 
paraformaldehyde. Immunolabeling was performed with anti-C-GIT1 (1:500) and with 
anti-V5 (1:300) Abs. Nuclei were counterstained with Hoechst. For subcellular 
localization of endogenous GIT1 and htt, differentiated PC12 and SH-SY5Y cells 
were used. PC12 cells were treated with 50 ng/ml NGF and grown on cover slips for 
6 d. SH-SY5Y cells were serum starved for 24 h and then differentiated with 10 nM 
IGF-I for 30 min. Cells were labeled with C-GIT1 (1:20) and 4C8 (1:20) Abs and 
viewed with a confocal microscope LSM510 (Zeiss). 



• Filter Retardation Assay 

HEK293 cells coexpressing HD169Q68 and selected network proteins were 
harvested 48 h post-transfection. Cell lysates were boiled in 2% SDS, 50 mM DTT for 
5 min. Aliquots containing 12.5, 25 or 50 ug of total protein were used for filtration on 
cellulose acetate membranes (Scherzinger et al., 1997). SDS-resistant aggregates 
were detected using anti-CAG53b or anti C-GIT1 pAbs. 



• Inhibition of GIT1 expression by siRNA 

For silencing of endogenous GIT1 expression, HEK293 cells were transfected with 
the siRNA duplex siRNA-GIT1 (5'-AAGCCTGGATGGAGACCTA GA-3') using 
TransMessenger (Qiagen) or Lipofectamin 2000 (Invitrogen) transfection reagents. 
48 h post transfection, cell lysates were analyzed for GIT1 expression by 
immunoblotting using C-GIT1 Ab. To examine the effect of endogenous GIT1 
silencing on htt aggregation, HEK293 cells were cotransfected with pHD169Q68 and 
siRNA-GIT1 and subjected to filtration after 72 h. 



• Detection of GIT1 in R6/1 mouse and human HD brains 
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For immunocytochemistry, mice were deeply anaesthetized and perfused through the 
left cardiac ventricle with 4% paraformaldehyde in 0.1 M phosphate buffer. Brains 
were removed and postfixed overnight in 4% paraformaldehyde. Sections were 
processed for immunocytochemistry as described (Gutekunst et al., 1999). EM48 
(1:1000) and C-GIT1 (1:100) pAbs were used. 

Tissues from 8 human HD and 7 control brains were used in this study. Two HD 
cases were, classified as grade 3 of neuropathologies severity, six cases as grade 4. 
Standard protocols were used (Lunkes et al., 2002) for immunolabeling with 2B4 
mAb (1:200) and C-GIT1 pAb (1:50). For Western analysis of total protein lysates 
from frontal cortex, the C-GIT1 pAb (1:300) was used. 



Example 7:Two-hybrid screens 

To generate a PPI network for HD, we used a combination of library and matrix yeast 
two-hybrid screens (Fig. 7A). Previous studies have shown that htt potentially 
participates in clathrin-mediated endocytosis, apoptosis, vesicle transport, cell 
signaling, morphogenesis and transcriptional regulation (Harjes and Wanker, 2003; Li 
and Li, 2004). For this reason, we selected 50 cDNAs encoding proteins involved in 
these processes, including 5 different N-terminal htt fragments, as well as proteins 
known to interact with htt, for subcioning into a DNA binding domain vector to 
express LexA fusion proteins as baits (Suppl. Table 1). The resulting plasmids were 
sequenced and introduced into yeast strain L40ccua, which carries three reporter 
genes, HIS3, URA3 and lacZ, for two-hybrid interaction analysis. 

Forty of these baits did not activate the reporters by themselves and were used 
individually for cotransformation screening of a human fetal brain cDNA library 
expressing GAL4 activation domain (AD) hybrids as preys. In each screen, 1 x 10 6 
auxotrophic transformants were tested on selective plates, and 1-50 positive colonies 
were typically obtained. Restriction analysis and sequencing revealed that about 12% 
of all positive clones expressed preys with correct in-frame sequences, while 88% of 
the clones contained plasmids with out-of-frame sequences or sequences from non- 
protein-encoding regions, which were discarded. 27 preys were identified only once, 
while the other 1 1 were found up to four times as independent AD fusions. Plasmids 
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with the longest coding regions were used for subsequent studies. The preys 
identified by the library two-hybrid screens were tested together with their respective 
baits for activation of reporter gene expression in cotransformation assays. Only 
prey/bait combinations that activated the reporter gene expression in two 
independent cotransformation assays were selected for further two-hybrid studies 
and in vitro pull-down assays (Fig. 9). Starting with 40 baits in the library and 
subsequent cotransformation screens, we identified 41 PPIs among 18 bait and 38 
prey proteins. 

For a second round of two-hybrid screens, cDNAs encoding 12 prey proteins were 
selected from literature verified interactions and from interactions confirmed by in 
vitro binding experiments (Suppl. Table 2), and subcloned into a DNA binding domain 
vector. The resulting baits were tested for autoactivation, and 10 were screened 
against a human fetal brain cDNA library as described above. We identified another 
14 PPIs among 5 bait and 13 prey proteins. Nine preys were found once and 4 were 
discovered multiple times as independent AD fusions. All interactions were confirmed 
by cotransformation assays. 

Finally, an array-mating screen was performed to connect bait and prey proteins 
identified in the cDNA library transformation screens (Fig. 7A). L40cca yeast cultures 
were transformed with plasmids encoding the 51 prey proteins obtained in the first 
and second round of cDNA library screens and arrayed in 96-well microtiter plates. 
Pre^cDNAs were also subcloned into DNA binding domain vectors and introduced 
into an L40ccua strain to generate additional baits for interaction mating. Including 
the ones already used for the library screens, we arrived at 46 baits, which did not 
autoactivate the reporter genes (Table 7). These baits were used individually for 
mating against the matrix of prey proteins. Diploid yeast clones, formed on YPD 
plates, were selected on agar SDN plates, and further transferred by a spotting robot 
onto SDIV plates to select for Y2H interactions (Fig. 7B). We examined 2346 (51x46) 
pair wise combinations of baits and preys in the mating assay reproducing all 55 two- 
hybrid interactions, which had been found in the library screens. In addition, 131 new 
PPIs were found by interaction mating and subsequently reproduced in 
cotransformation assays. Using this combination of library and matrix two-hybrid 
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screens, a total of 186 PPIs among 35 bait and 51 prey proteins could be identified 
(Fig. 8A); 

Sequence analysis of the cDNAs revealed ORFs ranging from 82 to 728 amino acids 
in size (Table 7). In a systematic Blast search, 77 of the 86 bait and prey protein 
fragments were identical to a SwissProt or TrEMBL protein entry 
(http://us.expasy.org/sprot/). Nine proteins showed 75-99% identity to their best 
respective database hit and either contained single amino acid substitutions, variable 
polyQ lengths or small regions of sequence variation. Uncharacterized proteins were 
named according to their interaction partners. 

This chapter describes the whole yeast two hybrid screening procedure and obtained 
fundamental data. A full description of our final datasets are shown in tables 6 to 9. 
Table 6 contains a compilation of all found protein-protein interactions in the 
Huntington's disease protein network. Some of these interactions are already known 
and literature-cited. A dataset which describes only new identified interactions will be 
found in Table 9. Table 7 characterizes all proteins involved in the protein network. 
Most of these proteins are known from different databases but some proteins are still 
unknown (Table 8). Nucleic acid and amino acid sequence data for all network- 
proteins are available from Figure 6. 

Example 8: Functional assignment of yeast two-hybrid data 

To chart two-hybrid interactions identified in this study, previously reported, or 
verified by independent methods, a matrix representation was used (Fig. 8A). We 
assigned a subcellular localization to each network protein by examining various 
sources of literature and, based on the experimental data, we grouped the proteins 
into six broad functional categories (Fig. 8A, Table 7). 18 proteins in the HD network 
are involved in transcriptional regulation or DNA maintenance; 14 proteins mainly 
participate in cytoskeletal and transport processes. We assigned 7 proteins to cellular 
signaling and fate, another 5 to protein synthesis and turnover, and 3 proteins to 
cellular metabolism. 16 proteins of unknown function were identified, participating in 
72 interactions. The number of interactions identified for each protein varied from 1 to 
24, with 2.6 interaction partners on average. Interestingly, proteins such as htt, 
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BARD1, GADD45G, HIP5, HZFH, PIASy, BAIP3 or VIM interact with more than 15 
other proteins, forming hubs in the HD network, while 15 proteins have only one 
interaction partner. 

For htt, 19 different interacting partners from various functional groups were 
identified, of which HIP1, CA150, SH3GL3 and HYPA had been described previously 
(Harjes and Wanker, 2003). 6 of the htt partners are involved in transcriptional 
regulation and DNA maintenance, 4 function in cellular organization and transport 
and 3 in cellular signaling, supporting the hypothesis that htt is involved in these 
processes. Moreover, we have detected 6 novel htt interacting proteins of unknown 
function termed HIP5, HIP11, HIP13, HIP15, HIP16, and CGI-125. 

Using 5 different N-terminal htt fragments as baits, the potential htt-binding sites of 
13 interaction partners were mapped (Fig. 8A). For the proteins CA150, HYPA, 
PNF2, SH3GL3, CGI-125 and HIP13, however, a conclusive determination of the htt 
binding region was not possible with the two-hybrid assay, because these proteins 
bound to HDexQ20, HDexQ51 and HD1.7, but not to HDdl.O (Fig. 8A). We suggest 
that these proteins bind to the htt exon 1 fragment, but this binding region might be 
masked in the HDdl.O protein, while it is accessible in the HD1.7 fragment. 
Interestingly, we found that HP28 and HIP15 bind to HDexQ51, but not to HDexQ20, 
HD1.7 and HD1.0, indicating that the interaction of these proteins with htt is 
• enhanced by the expanded polyQ repeat. Thus, HP28 and HIP15 may be disease 
specific htt interactors. 

To generate a more comprehensive HD interaction map, we supplemented our two- 
hybrid network (red diamonds) with all 38 known direct htt interaction partners 
(Suppl. Table 4 and Fig. 8B, blue squares). Furthermore, we added 83 human 
proteins (green triangles), identified from protein interaction databases HPRD, MINT, 
and BIND that bridge any two proteins in our extended network. Using this approach, 
we obtained an interaction network for htt containing a total of 181 proteins and 591 
PPIs (Fig. 8B and Suppl. Table 5). 



Example 9: Verification of PPIs 
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Comparison with literature-cited interactions revealed that more than 89% of the two- 
hybrid interactions identified are unknown. 30 PPIs have been reported previously 
using two-hybrid methods, coimmunoprecipitations or affinity chromatography-based 
techniques; 21 of these were detected in our Y2H assays (Fig. 8A, Suppl. Table 3). 
In most cases, the occurrence of false negatives can be explained by the lack of 
essential domains in one of the protein fragments. For example, an interaction 
between p53 and hADA3 has been described (Wang et al., 2001), with the first 214 
amino acids of hADA3 being essential for this interaction. It escaped our two-hybrid 
analysis, because a C-terminal hADA3 fragment (amino acids 235-432) was used. 

Failure to detect interactions may also result from the high stringency of our two- 
hybrid assay, which can be attributed to low protein expression levels and the 
simultaneous use of three reporters. Our system is particularly designed to minimize 
false positives, which are known to occur frequently in two-hybrid assays (von Mering 
et al., 2002). To determine the rate of false positives in our system, we directly 
assessed 54 interactions from the two-hybrid network by in vitro pull-down 
experiments, mainly focusing on htt and its immediate interaction partners. Proteins 
were expressed as GST-fusions in E. coli, and their interacting partners as HA- 
fusions in COS-1 cells. After immobilization of GST-fusion proteins to beads, the 
potential interaction partners were pulled down from COS-1 cell extracts. Binding 
was detected by SDS-PAGE and immunoblotting. Using this assay, 35 interactions 
representing 32 different protein pairs were verified successfully (Fig. 9). Failure to 
detect an interaction by GST pull-down assays could be due to low protein 
expression levels or the lack of appropriate protein modifications. Therefore, the 19 
non-verified protein-protein interactions are still valid until further experiments show 
contradictory results. The rate of 64.8% verified interactions suggests that in our Y2H 
network false positives might appear less frequently than described for other PPI 
studies (von Mering et al., 2002). 



Example 10: GIT1 promotes htt aggregation 

Several lines of evidence indicate that aggregation of mutant htt is linked to disease 
progression and the development of motor symptoms (Davies et al., 1997; Sanchez 
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et al., 2003). Therefore, cellular proteins that influence aggregate formation are 
potential modulators of disease pathogenesis. In order' to identify such proteins, we 
screened all 19 direct htt interaction partners (Fig. 8A) for their ability to enhance htt 
aggregation in a cell-based assay (Sittler et al., 1998). In this assay, HEK293 cells 
were cotransfected with constructs encoding an aggregation prone N-terminal htt 
fragment with 68 glutamines (HD169Q68) and a network protein. After 48 h, 
formation of SDS-insoluble htt aggregates was analyzed by a filter retardation assay 
(Scherzinger et al., 1997). In this time period HD169Q68 by itself formed only few 
aggregates. In comparison, coexpression of the C-terminal GIT1 fragment found in 
the Y2H screens (GIT1-CT) increased the amount of htt aggregates 3-foid (Fig. 10A). 
Coexpression of HD169Q68 with other htt-interacting proteins, on the other hand, did 
not enhance htt aggregation (data not shown). 

It has been described previously that GIT1 and its homologue p95-APP1 are able to 
form homo- and heterodimers in vitro and in vivo (Kim et al., 2003; Paris et al., 2003). 
Therefore, we wondered whether GIT1-CT by itself is able to form SDS-insoluble 
protein aggregates in mammalian cells. As shown in Fig. 10A, we did not detect 
aggregates in the filter retardation assay upon transient overexpression of GIT1-CT. 
However, in cells coexpressing HD169Q68 and GIT1-CT, stable SDS-resistant 
aggregates immunoreactive with the anti-C-GIT1 antibody were formed, indicating 
that both proteins coaggregate in cells, and that GIT1-CT is an integral part of the 
insoluble htt aggregates (Fig. 1 0A). 

Next, we tested whether full-length GIT1 is able to accelerate htt aggregation in 
mammalian cells. Analysis by filter retardation assay revealed that full-length GIT1 
enhances htt aggregation in a dose dependent manner (Fig. 10B). However, 
compared to GIT1-CT, it was less efficient in stimulating HD169Q68 aggregation in 
the cell model, indicating that the N-terminally truncated GIT1 fragment is a more 
potent enhancer of htt aggregation than the full-length protein. 

As previous studies have shown that the expression of C-terminal GIT1/p95-APP1 
fragments induces the formation of large vesicular structures in mammai^n cells (Di 
Cesare et al., 2000; Matafora et al., 2001), we analyzed the effect of GIT1-CT on 
HD169Q68 aggregation by indirect immunofluorescence microscopy. We found that 
expression of GIT1-CT alone induced the accumulation of large vesicular structures 
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in the perinuclear region (Fig. 10Cb). In comparison, when HD169Q68 was 
expressed alone, the protein was distributed in the cytoplasm, and no large 
aggregates or inclusion bodies were observed (Fig. 10Ca). However, when 
HD169Q68 and GIT1-CT were coexpressed (Fig. 10Cd-f), htt was almost exclusively 
detected in the perinuclear vesicles (Fig. 10Cd), indicating that GIT1-CT 
overexpression induces the relocalization of htt to membranous structures. A similar 
effect was observed when full-length GIT1 and HD169Q68 were coexpressed in 
COS1 cells, however, the rate of vesicle formation and htt recruitment was lower, 
compared to GIT1-CT/HD169Q68 expressing cells (data not shown). The 
colocalization of GIT1 with the early endosomal marker EEA1 is shown in Fig. 10Cc. 
Together, these results suggest that the enhancement of HD169Q68 aggregation in 
mammalian cells is due to the recruitment of mutant htt into vesicular structures 
induced by overexpression of GIT1 or GIT1-CT. 



Example 11 : GIT1 is crucial for the formation of htt aggregates in mammalian 
cells 

Next, we investigated whether endogenous GIT1 promotes htt aggregation in 
mammalian cells. In order to reduce endogenous GIT1 levels in HEK293 cells, we 
employed the short-interfering RNA (siRNA) technology (Elbashir et al., 2001). Cells 
were cotransfected with HD169Q68 and GIT1-specific siRNA, . and silencing of 
endogenous GIT1 was monitored 48 h post transfection by Western blot analysis 
(Fig. 10D). We found that siRNA treatment specifically reduced endogenous GIT1 by 
-80% and caused a strong decrease of HD169Q68 aggregate formation (Fig. 10E). 
After incubation for 72 h, SDS-resistant HD169Q68 aggregates were detected in 
untreated, but not in siRNA treated cells. This indicates that physiological levels of 
GIT1 are critical for htt aggregation in mammalian cells, and that an inhibition of GIT1 
expression dramatically slows down aggregate formation. A similar effect was also 
obtained when GIT1-specific siRNA was applied to cells overexpressing GIT1-CT 
and HD 1 69Q68 proteins (data not shown). 



Example 12: Verification of the htt-GIT1 interaction 
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The interaction between GIT1-CT and htt was confirmed by coimmunoprecipitation 
from COS-1 cells transfected with constructs encoding the first 510 amino acids of htt 
with 68 glutamines (HD510Q68) and an N-terminally truncated hemagglutinin (HA) 
tagged HA-GIT1-CT (aa 249-770) protein. 40 h post-transfection, cell extracts were 
prepared and treated with GIT1 antiserum. HD510Q68 and HA-GIT1-CT were 
detected in the immunoprecipitates on Western blots with anti-htt antibody 4C8 and 
anti-HA antibody 12CA5, respectively (Fig. 11 A). 

The GIT1-htt interaction was also detected in healthy human brain. Protein extracts 
prepared from cortex were treated with the anti-htt antibodies CAG53b and HD1, and 
the precipitate was probed for the presence of GIT1 (Fig. 11B) with a GIT1 specific 
antibody (NT-GIT1; Fig. 13). Full length GIT1, migrating at about 95 kDa (Vitale et al., 
2000), was precipitated by both anti-htt antibodies in a concentration dependent 
manner, indicating that a protein complex containing htt and GIT1 is formed under 
physiological conditions. 

Next, we examined the colocalization of endogenous htt and GIT1 in differentiated 
PC12 cells by confocal immunofluorescence microscopy. Both proteins were mainly 
detected in the cytoplasm, but were also present in the neurite-like extensions (Fig. 
11 Cab). Colocalization, indicated in yellow, was visible in cytoplasmic complexes in 
the perinuclear region (Fig. 11Cc) as well as in a number of intracellular structures 
scattered throughout the neuritic extensions. GIT1 was also detected in adhesion-like 
structures at the tip of the extensions, as previously reported (Di Cesare et al., 2000; 
Manabe Ri et al., 2002). These regions, however, did not contain htt protein. Similar 
results were obtained when the endogenous localization of GIT1 and htt was 
analyzed in differentiated neuroblastoma SH-SY5Y cells using confocal 
immunofluorescence microscopy (Fig. 11Cd-f). 



Example 13: GIT1 localizes to htt aggregates in patient brain 

Our findings suggest that GIT1 might also be a component of neuronal inclusions 
containing htt aggregates in brain of HD patients and transgenic animals (Davies et 
al., 1997; DiFiglia et al., 1997). To investigate this possibility, we first assessed the 
distribution of GIT1 in brain slices of R6/1 transgenic mice expressing a human htt 
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exon 1 protein with 150 glutamines (Mangiarini et al., 1996). In wild type mice, GIT1 
specific immune-reactivity was diffused in the cytoplasm and nuclei of neurons 
throughout the brain. In R6/1 brain, however, in addition to a diffuse staining, GIT1 
immunoreactivity was also present in large nuclear and cytoplasmic puncta 
containing htt aggregates (Fig. 12A). To further confirm these data, we examined the 
subcellular distribution of GIT1 in HD patient and healthy cortex (Fig. 12B). In patient 
brain, GIT1 specific antibodies labeled neuronal nuclear inclusions as well as the 
neuropil aggregates characteristic of HD (DiFiglia et al., 1997). In contrast, neurons 
from control tissue showed only diffuse nuclear and cytoplasmic GIT1 
immunostaining. Fig. 12C shows colocalization of htt and GIT1 in neuronal nuclear 
inclusions. 



Example 14: GIT1 is degraded in HD patient brain 

The presence of GIT1 in protein extracts from HD affected and unaffected cortex was 
also analyzed by SDS-PAGE and immunoblotting. As shown in Fig. 12D, full-length 
GIT1 protein migrating at about 95 kDa was detected in healthy brain (Fig. 12D), but 
was significantly reduced in HD. Interestingly, in HD, but not in control brain, 
prominent GIT1 degradation products migrating at about 25-50 kDa were detected 
with the C-terminal GIT1 antibody C-GIT1 (Fig. 12D). In strong contrast, no such 
products were observed when the N-terminal GIT1 antibody NT-GIT1 directed 
against the ARF-GAP domain was used (data not shown). This indicates the 
formation of large amounts of N-terminally truncated GIT1 fragments in HD brain, 
which may be a significant factor in disease pathogenesis. 
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Claims 

1 . A method for generating a network of direct and indirect interaction partners of a 
disease-related (poly)peptide comprising the steps of 

(a) contacting a selection of (poly)peptides suspected to contain one or several 
of said direct or indirect interaction partners with said disease-related 

. (poly)peptides and optionally with known direct or indirect interaction 
partners of said disease-related (poly)peptide under conditions that allow 
the interaction between interaction partners to occur, 

(b) detecting (polypeptides that interact with said disease-related 
(poly)peptide or with said known direct or indirect interaction partners of 
said disease-related (poly)peptide; 

(c) contacting (poly)peptides detected in step (b) with a selection of 
(poly)peptides suspected to contain one or several (polypeptides 
interacting with said (poly)peptides detected in step (b) under conditions 
that allow the interaction between interaction partners to occur; 

(d) detecting proteins that interact with said (poly)peptides detected in step (b); 

(e) contacting said disease-related (poly)peptide and optionally said known 
direct or indirect interaction partners of said disease-related (polypeptide, 
said (poly)peptides detected in steps (b) and (d) and a selection of proteins 
suspected to contain one or several (poly)peptides interacting with any of 
the afore mentioned (poly)peptides under conditions that allow the 
interaction between interaction partners to occur; 

(f) detecting (poly)peptides that interact with said disease-related 
(poly)peptide and optionally said known direct or indirect interaction 
partners of said disease-related (poly)peptide or with said (poly)peptides 
identified in step (b) or (d); and 

(g) generating a (poly)peptide-(poly)peptide interaction network of said 
disease-related (poly)peptide and optionally said known direct or indirect 
interaction partners of said disease-related (poly)peptide and said 
(poly)peptides identified in steps (b), (d) and (f). 

2. The method of claim 1, wherein said contacting step (e) is effected in an 
interaction mating two hybrid approach. 
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3. The method of claim 1 or 2, said method comprising after step (d) and before step 
(e) the steps of: 

(d') contacting (poly)peptides detected in step (d) with a selection of 
(polypeptides suspected to contain one or several (polypeptides interacting 
with said (polypeptides detected in step (d) under conditions that allow the 
interaction between interaction partners to occur; and 

(d n ) detecting proteins that interact with said (polypeptides detected in step (d'). 

4. The method of any one of claims 1 to 3, wherein said disease-related protein is a 
protein suspected of being a causative agent of a hereditary disease. 

5. The method of any one of claims 1 to 4, wherein said disease-related protein is 
huntingtin and wherein said interaction partners are the interaction partners as 
shown in tables 6, 7 or 9. 

6. The method of any one of claims 1 to 5, said method comprising the step of 
determining the nucleotide sequence of a nucleic acid molecule encoding a direct 
or indirect interaction partner of the disease related protein. 

7. The method of any one of claims 1 to 6, wherein said selections of proteins are 
translated from a nucleic acid library. 

8. The method of any one of claims 1 to 7, wherein said selection of proteins in step 
(a) and/or (c) and/or (d') and/or (e) is the same selection or a selection from the 
same source. 

9. The method of any one of claims 1 to 7, wherein said selection of proteins in step 
(a) and/or (c) and/or (d') and/or (e) is a different selection or a selection from a 
different source. 

10. The method of any one of claims 1 to 9, wherein said method is performed by 
contacting the proteins on an array. 
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11. The method of any one of claims 1 to 10, wherein said interactions are detected 
by using the yeast two-hybrid system. 

12. The method of any one of claims 1 to 11, containing after step (b), (d), (d") or (f) 
the additional steps of isolating a nucleic acid molecule with homology to said 
cDNA expressing the encoded protein and testing it for its activity as a modulator 
of huntingtin, wherein said nucleic acid molecule is DNA, or RNA, preferably 
cDNA, or genomic or synthetic DNA or mRNA. 

13. A nucleic acid molecule encoding a modulator of huntingtin, wherein said 
modulator is a protein selected from table 8. 

U.The nucleic acid molecule of claim 13, wherein said nucleic acid molecule is DNA, 
preferably cDNA, genomic DNA, or synthetic DNA or RNA, preferably mRNA. 

15. The nucleic acid molecule of claim 13 or 14 fused to a heterologous nucleic acid 
molecule. 

16. The nucleic acid molecule of claim 15, wherein the heterologous nucleic acid 
molecule encodes a heterologous (polypeptide. 

17. A vector comprising the nucleic acid molecule of any one of claims 13 to 16. 

18. A host cell containing the nucleic acid molecule of any one of claims 13 to 16 or 
the vector of claim 17. 

19. A method of producing a (poly)peptide, comprising culturing the host cell of claim 
18 under conditions such that the (polypeptide encoded by said polynucleotide is 
expressed and recovering said (polypeptide. 

20. A (poly)peptide comprising an amino acid sequence encoded by a nucleic acid 
molecule of any one of claims 13 to 16, or which is chemically synthesized, or is 
obtainable from the host cell of claim 18, or which is obtainable by the method of 
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claim 19 or which is obtainable from an in vitro translation system by expressing 
the nucleic acid molecule of any one of claims 13 to 16 or the vector of claim 17. 

21. The (polypeptide of claim 20 fused to a heterologous (poly)peptide. 

22. A protein complex comprising at least two proteins, wherein said at least two 
proteins are selected from the group of interaction partners listed in table 9. 

23. An antibody specifically recognizing the (poly)peptide of claim 20 or 21 or 
specifically reacting with the protein complex of claim 22. 

24. The antibody of claim 23 which is polyclonal, monoclonal, chimeric, single chain, 
single chain Fv, human antibody, humanized antibody, or Fab fragment. 

25. A method of identifying whether a protein promotes huntingtin aggregation, 
comprising 

(a) transfecting a first cell with a nucleic acid molecule encoding a variant of the 
huntingtin protein or a fragment thereof capable of forming huntingtin 
aggregates; 

(b) co-transfecting a second cell with 

(i.) a nucleic acid molecule encoding a variant of the huntingtin protein 
or a fragment thereof capable of forming huntingtin aggregates; and 

(ii.) a nucleic acid molecule encoding a candidate modulator protein 
identified by the methods of any one of claims 1 to 12 or a nucleic 
acid molecule encoding a modulator protein selected from table 6 or 
table 7; 

(c) expressing the proteins encoded by the transfected nucleic acid molecule of 
(a) and (b); 

(d) isolating insoluble aggregates of huntingtin from the transfected cell of (a) 
and (b); and 

(e) determining the amount of insoluble huntingtin aggregates from the 
transfected cell of (a) and (b) 

wherein an increased amount of huntingtin aggregates isolated from the 
transfected cells of (b) in comparison with the amount of huntingtin aggregates 
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isolated from the transfected cells of (a) is indicative of a protein's activity as an 
enhancer of huntingtin aggregation. 

26.A method of identifying whether a protein inhibits huntingtin aggregation, 
comprising 

(a) transfecting a first cell with a nucleic acid molecule encoding a variant of the 
huntingtin protein or a fragment thereof capable of forming huntingtin 

aggregates; 

(b) co-transfecting a second cell with 

(i.) a nucleic acid molecule encoding a variant of the huntingtin 
protein or a fragment thereof capable of forming huntingtin 
aggregates ; and 

(ii.) a nucleic acid molecule encoding a candidate modulator protein 
identified by the methods of any one of claims 1 to 12 or a 
nucleic acid molecule encoding a modulator protein selected 
from table 6 or table 7; 

(c) expressing the proteins encoded by the transfected nucleic acid molecule 
of (a) and (b); 

(d) isolating insoluble aggregates of huntingtin from the transfected cell of (a) 
and (b); and 

(e) determining the amount of insoluble huntingtin aggregates from the 
transfected cell of (a) and (b) 

wherein a reduced amount of huntingtin aggregates isolated from the 
transfected cells of (b) in comparison with the amount of huntingtin aggregates 
isolated from the transfected cells of (a) is indicative of a protein's activity as 
an inhibitor of huntingtin aggregation. 

27. The method of claim 25 or 26, wherein prior to step (d) the cells are treated 
with an ionic detergent. 

28. The method of any one of claims 25 to 27, wherein the huntingtin aggregates 
are filtered or transferred onto a membrane. 
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2y. A method for identifying compounds affecting an interaction of huntingtin or of 
a direct or indirect interaction partner of huntingtin comprising 

(a) contacting interacting proteins selected from the group of interacting 
proteins listed in table 6 and/or table 7 in the presence or absence of an 
potential modular of interaction; 

(b) identifying compounds capable of modulating said interaction. 

30. The method of any one of claims 25 to 29 , further comprising 

(a) modeling said compound by peptidomentics and 

(b) chemically synthesizing the modeled compound. 

31. The method of any one of claims 25 to 30, wherein said compound is further 
modified to achieve 

(i) modified site of action, spectrum of activity, organ specificity, and/or 

(ii) improved potency, and/or 

(iii) decreased toxicity (improved therapeutic index), and/or 

(iv) decreased side effects, and/or 

(v) modified onset of therapeutic action, duration of effect, and/or 

(vi) modified pharmakinetic parameters (resorption, distribution, metabolism 
and excretion), and/or 

(vii) modified physico-chemical parameters (solubility, hygroscopicity, color, 
taste, odor, stability, state), and/or 

(viii) improved general specificity, organ/tissue specificity, and/or 

(ix) optimized application form and route 
. b; 

(i) esterification of carboxyl groups, or 

(ii) esterification of hydroxy! groups with carbon acids, or 

(iii) esterification of hydroxyl groups to, e.g. phosphates, pyrophosphates or 
sulfates or hemi succinates, or 

(iv) formation of pharmaceutical^ acceptable salts, or 

(v) formation of pharmaceutical^ acceptable complexes, or 

(vi) synthesis of pharmacologically active polymers, or 

(vii) introduction of hydrophilic moieties, or 
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(viii) introduction/exchange of substituents on aromates or side chains, 
change of substituent pattern, or 

(ix) modification by introduction of isosteric or bioisosteric moieties, or 

(x) synthesis of homologous compounds, or 

(xi) introduction of branched side chains, or 

(xii) conversion of alkyl substituents to cyclic analogues, or 

(xiii) derivatisation of hydroxyl group to ketales, acetales, or 

(xiv) N-acetylation to amides, phenylcarbamates, or 

(xv) synthesis of Mannich bases, imines, or 

(xvi) transformation of ketones or aldehydes to Schiffs bases, oximes, 
acetales, ketales, enolesters, oxazolidines, thiozolidines or 
combinations thereof. 

32. A method of diagnosing Huntington's disease in a biological sample 
comprising the steps of 

(a) contacting the sample with an antibody specific for a protein of table 6 
or 7 or an antibody specific for the protein complex of claim 22; and 

(b) detecting binding of the antibody to a protein complex, 

wherein the detection of binding is indicative of Huntington's disease or of a 
predisposition to develop Huntington's disease. 

33. The method of claim 32, wherein 

(a) said protein complex contains GIT1 or 

(b) said antibody is specific for a protein complex containing GIT1. 

34. The method of claim 32, wherein 

(a) said protein complex contains at least one protein selected from htt, 
HIP15orHP28 

(b) said antibody is specific for a protein complex containing at least one 
protein selected from htt, HIP15 or HP28. 



35. A diagnostic agent/composition or pharmaceutical composition comprising the 
nucleic acid molecule of any one of claims 13 to 16, the (polypeptide of claim 
20 or 21 or the (poly)peptide mentioned in anyone of tables 6 and 7, the 
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antibody of claim 23 or 24, an antibody specifically reacting with a protein 
selected from table 7 and/or a protein selected from table 7. 

Use of the molecule of any one of claims 13 to 16, the (poly)peptide of claim 
20 or 21 or the (poly)peptide mentioned in anyone of tables 6 and 7, the 
antibody of claim 23 or 24, an antibody specifically reacting with a protein 
selected from table 7 and/or a protein selected from table 7, for the 
preparation of a pharmaceutical composition for the treatment of Huntington's 
disease. 
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>ALEX2 

AESWGAAMASAIAPPPGVTEALGAAEAF 
ATPGAHTGATPKATSATGAVPl^GGKGOT^ 

VI^LQKSPD PFIQQ V^ J&Wfi Iflf KIITlPHT Tf^ KAIMANjlgM^g- 

KIKVEILKILSNFAENPD^^ 

REFNKGSLFYLCTTSGVCV^ . 
>APPl'" 

EEEEESFPQPVDDYFVEPPQAEEEEETTOPPSSOT 
GFLRAKVIDLEERRMkQINEVM^^ 

THATRVIMjIJSTDQR^^ ■ 
KAQQMRFQVHTHLQVIEERVNQSLGLL^ 
.KGGLQPPDSKDDTPMTLPKGSTEQDAASPEKEK^ 
LGQGCPVRLCRVC 

>BAIP1 . . - 

RPRTKMATAMYLEHYLDS I E3STLPCELQRNFQLMRELDQRTEDKICAE ID I LA&E Y I S TVKTL'S PDQR 
VERLQKXQNAYSKCKEYSDDKVQLA^^ 

LKKGRGQKEKRGS RGRGRRT SEEDTPKKKKHKGGS EFTDT I LSVHP SDVLDMPVDPNEPTYCIjCHQ 

VS YGEMI G CDNPDCP I EWFHFACVDLTTKPKGKW 

>BAIP2 

SQC^SVTMHDTOAESFEVLVDYCTTGRVSLSE 

GTAI JaKFADAPDHlbcLASQAQS YIAHNFKQLSRMGS 1 REE TIADLTLAQIiLAVljRLbsijD IB S ERT 
VCHVAVQWIiEAAAKERGPSAAEVFKCTTO I VKKYCLDVI EGALQMRYG 

DLLYK&LVPVPNSSSS^^ 

S PLT SFAHTKTVT S S AVCV"S PDHDI YliAAQPRKDL WYKPAQNSW 

I Y I LGGRDP I TGVKLKBVECYSVQRNQWALVAPVPHS FYS FELI WQNYL YAWSKRMLCYDP SHN • 
MWLNCASLKRSDFQEA 

DAR5E SSTEWDLDGFSELDSESGS S 3 SFSDDEVWVQVAPORNAQDQQGSL 
>BAIP3 

GHNAPRKVTAVIYARKGSVLQSIEKI SS SVDATTVTSQQC^RDQEPKIHJ^EMASTSDKGAQGKND 

IOODSQGRSNKALiHLKSDAEFKKIFGLTKDLRVGLTRI PDHLTSGEGFDSFS SLVKSGTYKETEFMV • 

KEGERKQQNFDKKRKAKOTKKMDH I KKRKTENAYNAI INGEANVTGSQJjIi SSI LP TSDVS QKNT LT . 

SHSKTRQEKRTEmYYTHEK^ 

KQVLREKEAALEEMRKKMHQK * • 

>BARD1 

IAGFESLTCSFFWSRGLIASRSPRSLSSEGG^^ 
WAHSRAALDIttjEKLLRCSRCTNILM 

QLDSMIQLC S KLRNLLHDNEP SDLKEDKPRJCS LFNDAGNKKHS I KMWF SPRSKKVRYWS KAS VQT 

QPAIK20>ASAQQDSYEFVSPSPPiUDVSE ' 

KEESKQKIjV8FCSQPSVISSPQINGEIDL^^ 

VTPEKVCKlSIYIiTSKKSLPLEl^GK^GHHNRLS 

S PPSCKRKVGGTSGSKTVTCPMNSLVPHOVBHLjLH 

>CA150 

QQFI^GPLKILWPCCLFSQAPTTQDQTPSSAVSVATPTVSVSTPAPTATPVQTVPQPHPQTLPPA 

VPHSVPQPTTAIPAFPPVMVPP^ 

AVSEWTEYKTADGKTYYYHNOT^ 

EP IKEI KSEPKEEEMTEEEKAA.QKAKPVATAP I PGTPWC\A7WTGDERVFFYNPTTRIjSMWDRPDDL 
IGRADVDKI I QBP PHKKG^ELKKLRHPTPTMLS I QKWQFSMS AI KEEQELMEEINEDEPVKAKKR ' 
K 
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>CGI-125 

qrgyfkdkafvnylk^lywkd 

>CGl-74 




>CLH-17 

SAD S AIMNPASKVI ALKAGKTLQ I FNI EMKS KMKAHTIvrrDDVT FWKWI SLNTVALVTDNAV1£HWSM * 
EGESQPVK3yiFDRBSSLAGCQI IKTYRTDAKQKWLIiLXGI SAQQi^WGAMQLYSVDRKVSQP IEGHA 
ASFAQFKMEGNAEESTLFCFAVRGQAGGKLHI I SVGTPPTOTQPFPKKAVDVFFPPEAQM3FPVAM 
QISEKHDWFLITKXGYIHLYDLET 
>CLK1 

DAWVLEHXiNTTDPKTS TFRCVQMLEWFEHHGHI C I VFELLGIjS TYDF I KENGFLPFRLDHI RKMAYQ 
ICKSVOTLHSNKLTHTOLK^ 

TLVS TRHYRAPE VI LALGWS QPCDVWS I GC I L I E Y YLGFTVFPTHD S KEHLAMQMERI LGPLPKHMI 
QKTRJCRKYFHHDRIiDWDEHS S AGRYVSRRCKPIiKEFML SQDVEHERLFDL I QKMLEYDPAKRI TLR 
EALKSPFFDIiIjKKSI 
>DRP~1 ' 

KDKTFTLIPEGVNGIEERMTWm^ 

I WDPDKLKTITAKSHKSAVEYNI FEGMECHGS PLWI S QGKIVFEDGNINVNKGMGRFI PRK21FPE 

HLYQRVraRNKVFGLQGVSRGMYDG 

GAQIDDNNPRRTGHRIVAPPGGRSNITSLG 

>EFXA ' * 

MHKEALSEALPGDIWGFN SA.GYAP 
VLDCHTAHIACKFAELKEKIDRRSGK^ 
VRDMRQTVAVGVI KAVDKKAAGAGKVTK5 AQKAQKAK 
>EFlG(bait) - . 

AAGTLYTY PENWRAFKAL I AAQYSGAQVRVL S APPHFHFGQTKRTPEFIjRKFP AGKVPAFEGDDGF 

C^TFESl^IAYYVSNEELRGSTPE ■ 

EVRRI LGLl^AYLK?TRTFLVGERVTLiADI TWCTIJjWLYKQVIjEPSFRQAFPimTRWFLTCXNQPQ 

FRAVLGEVKLCEKMAQFDAK^AETQPKKDTPRKEKGS^ 

CEQALAAEPKAKDPFAHLPKSTFVLDEFKRKYSN^ 

LTQTF^<^IT@^QRLD&^ 

TWRKLDPGSEETQTLVREYFSWEGAFQHVGKAFNQGKIFK 
>EFlG(prey) 

AAGTLiYTYPEKnATRAFKAIilAAQYSGA 

C VFE S NAI A YYVSNEELRG S TP E AAAQ WQ WVS FAD S D I VP PAS T WVF P TLG I MHHNKQATENAKE 
EVRRI LGLLDAYLKTRTFLW FRQAFPNTNRWFLTCXNQPQ 
FRAVLGEVKLCEH^QFDAKKFAETQ 
CEQAliAAEPKAKDPFAHLPKSTFVIiPEFKRK^ 

LTQTFMS CNL I TGMFQRLDKXiRKNAFAS V ILFGTlflNS S S I SGVWFRGQELAFPLSPDWQVDYESY 

TWRKLDPGSEETQTLVREYFSWEGAFQHVGKAFNQGKIFK 

>FEZ1 

GNCSDTEIHEKEEEBFNEKSEITOSGINEEPLLTADQVIEEIEEMMQMSPDPEEEEEVLEEEDGGET 
SSQADSVI^QEMQALTQTP]^^ 

EKEVKNS F I TVLI EVQl^KQKEQREIMtCKRRKEKGLSLQS SRI EKGNQMPLKRF SMEGI SNIkQSG I 
RQTFGSSGTDKQYUjJTVI P YEKKAS PP S VEDLQMLTNI LFAMKEDNEKVPTLLTDYI LKVLCPT 
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>G45IP1 • 

MASSGGELGSLFDHHVQRAVCT 

ALYC^IEQY1M£EYPAEDFTEVYL^ 

KKLQMRKAYWKTTE^^ 

CELPLCYF S S KCMC S SGGPVDRAPDS S KDGRNHHKTM6HYNHND SLRKTQINSiiKNSVACPQAQKA 

Mtd-#j^^ * ; * " 7* 

>G45IP2 

RTCMPY IFSLSfcEALKCFRI RI^K^SDSHGVETiraiLPDTSLGGP 
AEEAALWRDLVRKVLASYLET^ 

GCFCAGCSEQ JGFSFVRPKLCAFSGLYYCDICHQDDASVI PARI IH1TWDLTKRPICRQALKFLTQI 
RAQPLIl^QMV^SLYEHraRM^ 
FSVAftLQQIADGVYEGFLKALIEFASQHVY^ 
TVFHQSCQAVVKKGCPRCARRRKYQEQKTIFA 
">G45IP3 

PNRGPLSPPlTOI^PSHVISLPLHNAPHtRPTNQHT^ 

NPIYYI^SFFLRRIFLSLLPLQPSPYPFIRRAL^ 

i>GADD45G 

GAGAEPGLECGWS WGAKGVCRWPGLGS PPRP PAG SRSLRTOLRElMQGAGKALHEIiljIiSAQRQ GCLT 
AGVYE SAKVLNVDPDl^VTFC VIIjAAGEEDHGDI ALQI HFTL I QAFCCENDI D I VRVGDVQRLAAI VG 
AGEEAGAPGDLHC I LI SNPNEDAWKD PALEKLSLF CEE SRS VNDWVP S I TLPE 
>GZ?l 

PQim>RSRQKCMSQSLiDLSEIAKAAKKKLQAi 

WERSAVPFLPVNPEYSATRNQGRQKXJ^FNAREFATLIIDILSEAK^ 

RSQSDIDDQHDYDSVASDEDTDQEPI^ 

AKVQQLMKVNSSLSDELRRLQREIH^ 

RQAFSlVlYEPGSAIiKPFGGPPGDELTTRLQPFHS^ 

SSPLLSCSQEGSRHTSKIiSRSGSGADSDYENT^ 

DPGLPSTEDVILKTEQVTKOTQEL^^ 

LRLLNASAYRLQS ECRKTVP PE PGAPVDFQliLTQQVI Q CAYD IAKAAKQLVT I TTREKJCQ 
>hADA3 • 

KDVDAIiLKKSEAQHEQPEDGCPFGAliTQRLLQAIiVEENI I SPMEDSPI PDMSGKBSGADGAS TSPR 

NQNKPFSVPHtfKSLESRIKEEL^ 

^RLAKEEVSRQELRQRVRM^^ 

>HB01 

DAERQEALGI VRR I GTDTEZ^TEPAGATVPAAAAAARI GTVGPQPPAMPRRKRNAGS S SDGTEDSD 
FSTDLEHTDSSESDGTSRRSARVTRSSA^^ 

PVTPKKYPLRQTRS S GSETEQWDF SDRETK3STTADHDE S P PRTPTGNAP S SESDIDI SSPWSHDE 

SIAKDMSLKDSGSDLSHRPKRRRFHESYNFN^ 

DBCKTVRAQSRDKQIEERMLSHRQDDNNRHATRHQ^ 

MEHRQTYGMTREPLLENLTSEYDL^ 

WYHS P YPbjdJ YARLGRLYMCE F CLKYMKSQTILRRHMAKC2VWKHP PGDE I YRKGS I SVFEVDGKKNK 
I YCQNLCLIjAiCLFLDHKTLYYDVEPFLF YVMTEADNTGCHIj I G YF S KEKNSFIjNYNVS C ILTMPQY 
I^QGYGKMjIDFS YLLS KVEEKVGS PERPLSDIjGLI SYRS YWKEVLLRYLHNFQGKEI S IKE I SQE 
TAVNPVD IVS TLQALQMLKYWKGKHLVIJ^ PKGT 
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>HD1.7 

MATLEKDMK3^SLKSPQQQQQQQQQQQCK»QQQQQQQQQP 
PQPPPPPPPPPPGPAV^SPLHRPKKELSATKKDRV^ 

LFLLC SDDAE S DVR>TVMECLNKVI KALMD SNL PRXjQLE LYKE I KXESIGAPR SLI^AALWRF AELAHL 
VRPQKmPYLVlSrLLPC^ 

LTAVGGI GQLTAAKBESGGRS RS GS I VELI AGGGS S CS PVLSRKQKGKVIjLGEEEALEDD SE SRSD 

VSSSALTASVKDEI SGEIAAS SGVS^ PGSAGHDI ITEQPRSQHTAGGLSGSGQL 

>HDdl.O 

MATLEKLMKAFESLKSFQQQQQQQQ 
PQPPPPPPPPPPGPAVAEEPIjHRPiaffi^ 

LFL1I4C SDQAESDVRMVADECLNKVT KALMD SNIiPRIjQLEIjYKE IKKItf GAPRS LRAALWRFAE LAHL 
VRPQKCRP YLVNLLPCLtTRTS KRPEE S VQETLAAAVPKIMAS FGNFAMDNE I KYLLKAF I AMLKS S 
SPTIRRTAAGSAVSICQHSRRTQYFYSWLLfcTVI^ 
>HDdl.3 

PRLQLEXjYKE I KKNGAPRSLRAALWRPAELAH^^ 

AAVPKIMAS FGNFANDNE I KVLLKAF I ANLK5 S SPTI RRTAAGSAVS I CQHSRR^TQYFYST^LNVti 
LGLLVPVEDEHSTLLI LGVXjLTLRYLVPLLQQQVKDT SLKGS FGFVTRKEMBVS P SAEQLVQVYELT 
LHHTQHQDHi!TWT^ 

GSSCS PVLSRKQKGKVLIiGEEEALEDDSESRSDVS SSALTASVKDEI SGELAASSGVSTPGSAGHD 

IITEQPRSQHT&GGLSGSGQL 

>HDexQ20 

MATLEICMMKAFESLKSFQQQQQQQQQQ^ 
QPPPPPPPPPPGPAVAEEPLHRJ? 

>HDexQ51 . . ' 

JYiATLEKLMKAFESL^ 

QQPPPPPPPPPPPQLPQPPPQAQPIjLPQPQPPPPPPPPPPGPAVLRSRCTDREST 
>HIP1 

ADTIX3GHRDRFMEQFTKLKDLFYRS 5NLQYFKRLI QI PQLPENP.PNFIiRASALSEHI S PVW'I PAE 
AS SPD SEPVLEKDDX^MDASQQJ^FDNKFDDI FGSSF S SDPFls^SQNGVNICDEKDHLIERLYRE 
. I SGLKAQLENMKTO^ 
AQRSLSEIERXAQAmQRYSKLKE^ 
SLERISDQGQRKTQEQLEVLESLKQE^^ 

SGAJUIREEELSAIjRKELQDTQLKL^TEE^^ • 

>HiPii 

VDLVTACD I RYCAQDJ^FQVKETOVGLAADVGTIjORLPKV'I GNQSLVNELAFTARKMMADE ALGSG 

LVSRVFPDKEVMLD^ 

VKS VQATTENKELKTVTF 9 KL 

>HIP13 * • . 

PCCSEDTIPSQVSDYDYFSVSGDQEAIX3QEFDKSSTI^ 

GPAMVTPGVAT I RRTPS TK3? SVRRGTI GAGP I P I KTPVI PVKTPTVPDLPGVLPAF PDGPEERGEH 
SPESPSVGEGPQGVTSMPSSIWSGQASVNPPIjPGPKPS I PEEHRQAI PESEAEDQEREPPSATVSP 
GQIPESDPADLSPRDTPQGEDMLm^ 
>ftlP15 ■ 

IHMAPPYPNIJSfMIET^ 

TAMARTKFH^HjKMLIjI^ SENP AVAKKLF S AKALS I FVGliFlTI EETNDNI Q I VI KMFQNI SNI I KS G 
KMSLIDDDP'SLEPLI SAFREFEELAKQLQAQIDNQWDPEVGQQS , 
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>HIP16 

DEEERNHRQMIKEAPAGDDVira 

IKAPEQPPRKDKNLPNVT INBKRWIHAAAHQVRVLPYPFTHHWQFSRTI QTP I GS TWNTQRAFQKL 

TTPKWTKPGHI INPI K&EDVGYRS S SR^DLS VI QRNPKRITTRHKKQLKKCSVX) 

>HIP2 

MANIAVQR IKRSFKEVI ^EE^^ 

Q YKQITOEMFKQTARLWAHVYA^^ 
SNX 

>HIP5(bait) 

FLKS I LKKBSKYEHGYLKAXjI INQSFKFGNQKAA&I RDS lELTKEKGAE I PKTIKKJjRWFDETSNI 
EISnSEAENSHSLKNKTGTTQQHSQQFHIQSGAGSNI I SVSTCAVNSADTKKSREDS I SENVTTLGGSG 
ADHMPI^CFIPSGYNFAKHAWPAS^ 

TNRKGAVI QPQSASKVNi FTQAQGKLI IPCPPPQSTSNIRSGKNI QVSQCQPVTPENPQNI I THNS 
. FNSIOlVLPTEHSLNQraQESSS 
TLYCTQRSPyCSESYPSVTLRTAEEESVPLW^ 
KRQNPGSVGQKYSEQINNFGQSVLLSSSEPKQTra^ 

LTVLNSKQ 1 QKSNLPL3SFKTQQFN I CTL6AE EQKI LB S IjNDUjTERLHYI QES I CKNP S I KNTLQI I P 
LiLEKREDRT S S CRDKR 
>HIP5 (prey) • 

FLKSILKCTSKYEHGYLKALII^ 
ElSDJlAENSHSLKKrKTGTTQQHSQQFHIQSGAGSNII 

ADHMPUsrCF I PSGYOTAKHAWPASKKEESKI PVHDD SKTKQGKPQRGRAKI IRKPGSAKVQSGFI C 
TlTOfcGAVIQPQSASKOT 

FNSKHVLPTEHSLNQWNQES S SPLSNACSDLVTVI PSLP S YCS SECQTFAKINHSNGTQAVARQDA 

TLYCTQRSPVCEESYPSWLRTAEEES 

KRQNPGSVGQKTSEQII^GQSVLLSSSEPKpTTRGTSY 

LTVLNS'KQ I QKSNLPLNKTQQFNT CTL SAEEQKI LE SLNDLNERLHYIQE S I CKNP S I KNTLQ IIP 

LLEKREDRTSSCRDKR • 

>HMP 

qeqvxieslaicsledalrqtasvtlq^ 
egalkerrkavdeaadallkakee 

kvqaaq s eakvvs qyiffilvvqarddfkrelds i ttpevlpgv^gms vsdladkls tddlnsli aeah 

RRI DQIiSTRELAEQKATEKQHI TLALE I QAEQDRKI EBVKDAM 

ENEMRTQLRRQAAAHTDHLRDVLRV^ 

AYARLRGIEQAVQSHAVAEEEARKAHQ ' 
FTQALTAAIPPESLTRGVYSEETLRARFYAVQKIiARRVAMIDETRN 

LKPPPELCPED INTFKLL SYASYC I EHGDLELAAKF WQLKGESRRVAQDWLKKUlMTIiETKQI VE 
1 LTAYAS AVGI GTTQ VQPE 
>BP28 • 

PPADSDIjKYDTPVIjVSROTEKRSPKARIiLKVSP 

LNAI LPPREWVEDTQLWI QQVS S TP STRMDWHLQEQLDLKLQQRQARETG I CPVRRELYS QCFDE 
LI REVtT I NCAERGLLLIiRVRDE I RMT I AAYQTLYES SVAFGMRKALQAEQGKS DMERKI AELBTEK 
RDLERQVNEQKAKCEATEKRE S ERRQVEEKKHttTEE I QFLKRTNQQLKAQIjEG 1 1 APKK 
>HSPC232 

RimADGCIYGVSRRARVVAYRJ^EMWSEGRYEY^ 
EYRDMRDGFRRKSFYS SHYARERSPYKRD^^ 
FHQSQHRl^RPVQSLKTSRDTSPSSGSAV^^ 

LPEISEYEAGSTAPt.FTDQPEEPESNTTHGlELFEDSOLTTRSKAIASKTKSiEQVYRQD 
WKMLIEKDPSLEKSIQFAIiRQNLHEIGERCVEELKOT 
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>HYPA ■ 

GRRRS SLSPTl^PGTGAERGGLMMGHPGMHYAP 

QMPGMBffiSVMPGMMMSHMSQA^ 

Q S TWEKPDDIiKTP AEQLLSKCPWKEYKSDS GKP YYYNSQTKESRWAKPKEIiEDLBGYQOT I VAGS L 
ITKSOTHAMI K^BSSKQEECTTTSTAPVPTTBIPT™ 

SALAKLSEKKQAFNAYKVQAKKKEK^ 
>HZFH 

HARFAJ^CLAESHQHLSKB^ 

LQMSBRS I LSRLA5 KGTEPHPTPAYPPGP YATPPGYGAAF S AAFVGALAAAGANYS QMPAGS FI TA 

ATNGPPVLVK^KBMVX^ 

>IKAP ' 

LKEGSPLEDLAIjLEALSEW 

IWTLTYQQNSATPVLGPNSTANSII^ 

>IMPD2 

DFLILPGYIDFTADQVDLTSALTKKITLCT 

QANKTOKVKKYEQGF I TDPWL S P KDRVRDVFEAKARHGFCG I P ITDTGRMGSRLVG IIS SRD IDF 
LKEEEHDCFLEE IMTKREDLVVAP AGI TI*KEA1?TE I LQRS KKGKLF IVNEDDELVAI I ARTDLKKHTR 
DYPIiASKDAKKQLLCGAAIGTHEDDKYRLDLlA I FQINMI KYIKDKYPNXi 

QVI GGtT\n/TAAQAKNL IDAGVDAIjRVGMGSGS I CITQEVIjACGRPQATAVYKVSKyARRFGVPVIA 
DGGIQNVGHIAKAI^GABTVMM 

YFSEADKI KVAQGVSGAVQDKGS IHKFVPYIiI AGI QH9 CQD I GAJCSIjTQVRAMMYSGELKFE KRT S 
SAQVEGGVHSLHSYEKRLF . 

>KPNA2 ' 
' AWALOTI ASGTSEQTK&VVDGGAI PAF I S LLAS PHAH I SEQAVWALGNI AGDGS VFRDLVI KYGAV 

DPLXjALIiAVPDMS S IjACGYLRNXjTWTL Sl^CRJSI33srPAP P IDAVEQ I LP TLVRLLHHDDPBVLADTC 
' WAISYLTDGPNERIGMWKTGVVPQ 

VFP SXJjTNPKTNI qkeatwtmsni TAGRQDQ I qqvv^glvpflVsvlskadfktqkeawavtny 

TSGGIVEQIVYIjVHCGIIBPLM^ 

DKIEAliQNHENESVYKASLSL^ 

>KPNB1 

LAAVGIiVGDLCRALQSNI IPFCDEVMQLLLE^GNENVHRSVKPQ GGEFKKYLE 
VVLNTIjQQASQAQyDKSDYDl^^ 

FIDHIAGDEDHTDGWAGRAGLIGDLCTAFGKDVLKLVEARPMIH 

TKELRKLKNQA 

>KU70 

KTRTFNTSTGGLLLPSDTK*RSQIYGSRQ 

SLFVYPEES LVI GS STLF SALL I KCLEKE VAAIjCRYTPRRNI P P YFVALVPQEEELDDQKI QVTPP 
GFQLVFL PFADDKRKMPFTEKIMATPEQVGKMKAI VE KIJIFT YRSDS FENPVLQQHFRNLEAXiALD, 
LMEPEQAVDLTLPKVEAMJTKRLGSLVDE^ 

ELKTHI S KGTLGKF TVPiyn^KEACRAYGLK5GLK3CQELLK?UjTKHFQD 
^UCTBl ■ 

VDAVAVDAAAVSAKASKVHELiNEKI GKLLAKAEQLGAEGNVDESQKI LMEVEKVRAiCKKEAEEEYR 
NSMPAS S FQQQKLRVCEVC SAYLGHODlffiRRlADHFGGKIjHLGF I Q IREKI^QLRKTVAEKQEKRN 
QDI&RKREEREREERLSRRSGSRTRDRRRSRSRD^ 

RSHSRGHRRASRDRS AKYKFSRERASREE SVJE S GRS ERGPPDWRLE S SNGKMASRRSEEIQSA.GE I 
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' >MAGEHl 

ASFPRTAySFEPIAGDMPR^^ 

PEEDP STP EEASTTPEEAS STAQAQKPS VPRSNFQGTKKSLLMS I LAL I F IMGNS AKEALVWKVLG 
KLGMQPGRQHS I FGDPKKI VTEEFVl^GYIi JY KPVPRS 5 PVE YEFFWGPRAHVES SKLKVMHFVAR . 
VRJNRC SKDWPO^rroTOSBDDAS LNSGARGYSAP 
>MAPllc3 

PTQAF FLLVNQHSMVSVS TP I AD I YEQEKX)BDGFLYMVYASQETFGF 
>mHAPl 

PKEQ VQSGAGDGTGSGDF AAGTPTTQPAVGPAP EP SAE PKPAPAQGTGSGQKSGSRTKTGS FCRSM 
1 1 GDSBAPWTRYVFQGPYGPRATGLGTGKAEG I WKTPAAYI GRRPGVS GPERAAF IRELQE ALCPN 
PPPTKKITEDDVKVMLYLLEEKERDIj^ 
QVHLRDD^QLYSDS 

LETLQQKI,RLLfiEENDHLREE^ 

qqkei tqlqaei tklqqrcqs ygaqteklqqmlasekgihseslrags ymqdygsrprdrqedgks 

brqrssmpagsvthygysvplt^ 

amppppax 

>irip53 ' 

VTETPGP VAPAPATPWPL S S FVP S QKTYQGNYG FHLGFLQ S GTAKS VMCTYS PPIiMKLPCQLAKTC 

PVQLWVSATPPAGSRVRAMAIYKKSQHMT^^ 

DRQTFRHSVVVPYEPPEAGSEYXTIHYKYM^ 

VCAC'PGRDRRTOESOTRKi^LCPELPPGSAKRAL 

MFRELNPALELKDAHATEESGDSRAHS 

>NAG4 

RDRVENEAE KDLQCHAPVRLDLP PEKPLXS SLAKQEEVEQT PLQEALNQLMRQXiQRKDPS AFFS FP 

VTDFIAFGYSMIIKHPMDFSTMKEKIK^ 

HSGMKILSQERIQSLKQSIDFMADL^ 

KENKXKDKDMLEDKFKSNWI^ 

DPIVGEPGYCPVRLGMTTGRLQSGWTLQGF 

DDSDLIYSTYGEDSDLPSDFSXHEFIATCQDYPYVMADSLLDVLTKGGHSRTLQEM^ 
TRTLDTAKEivrEITEVEPPGRLDSSTQDRLIAIiKAV 

LQEAQNERL S TRPPPNMI CLLGPS YREMfflLAEQVXlSnsrLKELAQQVTPGD I VSTYGVRKAMGI SIPS 

PVMENNFVDLTEDTEEPKKTDVAECGPGGS 

>NEFL. 

LiSPLSSLSGLPPPPRAGEPPAATMSSFSYEPYYSTSYl^ 
SAPVSSSLSVRRSYSSSSGSLMPSLE2^LSQVAAISND^ 
ELEQQNKVLEAELiiVLRQKHSEPSRFRALYEQEIRDLRLAAEDAT^ 

ARYEEEVLS REDAEGRLMEARKGADEAALARAELE KRIDS IjMDE IS FLKKVHEE J3 I AELQAQX QYA 
Q I S VEMQVTKPDLS AALKDI RAQ YEKLAAiOSn^QNAEEWFKS RFTVLTE S AAKNTDAVRAAKDEVSE 
SRRiLKAKTLEI EAC^GMNEAIjEKQLQELEDKQNADI SAMQDTINKLEOTLRTTKBEMARYLKEYQ 

DLLNVKMALDIE sqvfgrsaygglqtss ylmstrsf 

PSYYTSHVQEEQIEVEETIEAAKAEEAKDEPPSEGEAJEEEEK^^ 

KEEEEGGEGEEGEETKEAEEEEKKVEGAGEEQAAKKKD 

>p53 . . 

MEEPQSDPSVEPPLSQETFSDLWKIiLPEJNKVIiS 
• PEAAPP VAPAPAAPT PAAPAP APS WPLS S S VP SQKTYQG S YGFRLGFLHS GTAKS VTCTYS PALNK 
MFCQLWCTCPVQLW^ 
GNLRVEYLDD'RNTFRHSVVyPYE^ 

LGRNS FEVRVCACPGRDRR^ EEENLRKKGE PHHfiLPPGSTKRAIaPl^TS SS PQPKKKPLDGEYFTL 
Q IRGREJ RFEMFRELNEALELKDAQAGKE PGGSRAHS SHLKSKKGQS T S RHKKLMFKTEGPD SD 
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>PFN2 

APRRPRCSAKGSK^GWQSYVDNlMCTQCCQEUVA^ 

KDRBGFFTNGLTLGAKKC S VI RDS LYVDGDCTMD I RTKS QGGEPTYl>ri^VGRAGRVL\^ VMGKEGV 

HGGGLNKKAYSMMCYLRDSGF 

>PIASy (bait) 

PPF^DEI&KPTELVPQMreKL^ 

PQEDQYPPNI AVKVNHS YC S VPGYYP SNKPGVEPKRPCRP INLTHLMYLS S ATNRI TVTWGNYGKS 
YSVAL1HJVRQLTS SEDLQRLKlIGVra 

VPCPJ^TCMLOCFDAVPYiQMNEKKPTWMCPVCDKPAPyDQLI IDGLLSKILSECEDADEI EYLV 
DGSWCPIRAEKERSCSPQGAILVLGPS^GLLPAPSVNGSGALGSTGGGGPVGSMENGkPGADVVr' 

dltldsssssedeeeeeeeeepedeegprpkrrcpfqkglvpac' 
>PIASy(prey) ' . '' ' " 

LVEAKlWVMSFRVSDLQMLLGWGRSKSGfcK^ 

SEPAPQPHRPl^PLTMHSTYDRAGAVPRTPrAGPNIDYPVLYGKYLNGLGRLPAKTLKPBVRiVKL 
PFF^LDEL&KPTELVPQ^KLQESFCIFALTPRQVELIRN^ 

PQEDQYPPNIAVKVNHSYCSVPGYYPSNKPGVEPKRPCRPINLTHIJ^LSSATTOiaTOGlTYGKS 

YSVALYLVRQLTSSELI^RLKTIGVKHPELCKALVKEKLRLDPDSBIATTGVRVSLICPLV^ 

VPCRAETCAHLQCFDAVPYLQMNEKKPTOMCPVCDKPAPYDQLIIDGLLSKILSECEnADE 

IXSSWCPrRAEKERSCSPQGAILVLGPSDANGLLPAPSVNGSGALGSTGGGGPVGSMENGKPGADW 

DLTLDSSSSSBDEEEEEEEEEDEDBEGPRPKRRCPPQKGLVPAC • 

>PLIP 

GEIIEGCRIjPVLRRNQBREDEWPLAEILSVKDISGRJK^ 
PKKEAIjTPTKNQLPGSRPGSPEREVKRKVEWSPAT^ 

KSNCLGTDEDSQDSSDGI P SAPHMTGSLVSDRSHDD J VTRMKNIEC JELGRHRLKP'WYFSPYPQEL 
TIlPVLYLCEPCLKYGRSLKCLQPJa.TKCDLRHPPGNEiYRKGTISFFEITORKNKS 

kcfichktlyyotdpflfyvmteto^^ 

EFSYELSKVEGKTGTPEKPLSDLGLLSYRSYWSQTILEILMGLKSESGERPQITINEISEITSIKK 
EDVISTLQYLl^Il^KGQYILTLSEDIVDGHEW . 
>PTN 

LSQRQDQVPRLPVQKSRQESPRAEENPKMREGKKETSESSVQKAGRAAAAQAGAAASRVPGLSGSM 
IAPC^RLSAREIWSNSKMQAQQYQQQRRKFAAA^ 

gf^qwsvcottsgdcglgtregtrtgabckqtmktqrckipcnwkkqfgaeckyqfqawge 

ALKTRTGS LKRALHNABCQKTVT J S KP CGKLTKPKPQAESKKKKKEGKKQEKMLD 
>PTPK . 

SNYINAAIJroSYRQPAAFIVTQYPLPN^^ 

rygpiqvecmscsmdcdvtiitoifricnltrpqegyiwqqfqylgwa^hrevpgskrsf^ 

EKWQEECEEGEGRTIIHCLNGGGRSGMFC&rGlVTO^ 
• YRFCYDVALEYLESS 
>SETBD1 

KASTSGI^-KDEGpIKQAKKEDTDDRNKMSWTESSPJSrrGYNPSPVKPEGLRRP 

^AQSNPDDVITLSSSTESEGESGTSRKPTAGQTSATAVDSDDIQTISSGSEGDDFEDKKHMTGP 

MKRQVAVKSTRGFALKSTOGIAIKSTI^SVDKGESAPVRKIWRQFYDGEESCYIIDAKLEGM.GR 

YLNHSCSPNLFVQlW'FVDTHDLRFPWVAFFASKRIRAGTELTTOYira 
RGRLL • ' • 
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.;>SH3GL3 

VAGLKKQFHKASQLFSEKI SGJ^GTKLDDEFLlMERKIDVtNKVVAEI LS KTTEYLQPNPAYRAKIj 
GMLNTVSKIRGQVKTT^^ 

vkSItidflql:^ - 

MFNFLEfltr^QV^^ SAAS5VPRREYKPRFVKRSSSE . 

FFPINYVEVIVPLPQ 
>S\M6*2 

RPRAQLRRESGGAESVTRPLR^ 

iS^TPLSKLJ^YCERQG^ 

LAGHSF 

>SuMO-3 J . 

PSSTi^SFFCRSWeeiiCAilLVRTWYIjF 

GSWQFKI K^TPLSKLM^YCERQGLSMRQIRFRFDGQP I3^TOTPAQLEMBDEDTIDVPQQ(JTG 
GVY 

SSPVKRQRMESALDQLKQFTTWADTGDFHAI 

GRKLGGSQEDQ I KNAI DKLFVLFGAE I LKKI PGRVSTETTOARLS FDKDJ^VARARMjI EL YKEA.G I 
SKDRILIIOLSSTWEGIQA^ 

TDKKSYE PLED PGVKSVTKI Yl^YYKKF S YKTT VMGAS FRNTGE I KAIAGCDFLTi S PKLLGELLQD 
' :&AiqjVPVLSAKZ^QA 
AENGK - 
>TCPG 

QTDI EITREEDFTRILQl^EEY J 

Dl^lARACC^IVSRPEELftEDW 

SEVEKttLQDAMQVO^ 

LIQNCC^STimLTSLRJ^ 

L^IDDIV0GHKi^CT)DQSRQGGAPi3A.GQE; 

>VIM 

■ SFRQRRSRAPT^Tim^^ . 
S SRS YVTTSTRlYSLGSAIaRPSTSRSLY 

INTE^KOTRflffiKVELQEiJ^ • 

RRQVDOLTND^ 

VESLQESi&^KKLH^ 

W^SKFADLSEAAl^ 

AOTQDTiGMiQDEIQ]^^^ SLPLPNFSSL 




QVQS LTGETTOALKG WE SLERQ I GRLQDEI Q^^EMAR^^ 
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>ZHX1 

EQTIM)LTFDGSFVKEENTAEQAESTEVS SSGISIS KTPIMKMMKNKyEK^ 

ENE I KPDREEI VENPS 9 S ASE SOTS TS I Vl^IHPS TASTVVTPAAVIjPGX^QV J T&VSAQQ!NSNIj I 
PKVLIPVNS IPTYKAALDI^ 

WTPEBVEE^RKQFNGTVHTVPQTI TVI PTHI STGSNGLPS I LQTCQ I VGQPGLVLTQVAGTNTLP 

QL^^^S^^Qt^EDSEI IRlM'±TGLTKGEI KKl^iDra^^ 

DSSDETTESPTVGTAQPKQSWNPFPDFTPQKFKEK^ 

LTI^EIDAWFTEKKKSKALKEEK^ 

HMLKSAFVRTQWP SPEEOTKLAKESGLARTD I VS WFGDTRYAWKNGNLKWYYYYQSANSS SMNGLS 
SIjBJ^GRGRPKGRGRGI^RGRPRGSKRINOTTORGP SLI KFKTGTAI LKDYYXjKHKFIjNEQDLDELV . 
NKSHMGYEQVREWFAERQRRSELGI^ 

spp 

>ZNF33B 

CYBCGKTFCLKSDLTIHQRTHTGEKPPACPECGKPFSHKSTLSQHYRTHTGEKPYECHECGKIPYN 

KSYLTKKKRTHTGEKPYEaraCGKTFdQ 

HTQEKPYKC^CGKSFCVKSGLIL^^ 

ECGKI F YRKSDLAKHQRSHTGEKPYECNT CRXTF SQKSNLI VHQRTHI GEKP YE 
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GCCGAATCAGTAGTTGGGGCTGC^TCGCTTCTG^ 
CETCGGGCTGGftGtfUlGCCCCTO 

GAGACTTCCAGGGCAGCGGTGCCTCCTGGGACAGTGGTGCCTACCG^^ 
. GTGACCGAGQGTCCTGGGGTAGCAGCACCTAC 
ACCGAGGCAl^ 

GCAACC CCTGGGGCTCACACTGG&GCTATACCGAAAGCCA(^ 

GGTGGAGGCAAGGGTGTAACCAGGTCCCGGAATGGGGGCAAGGGCA^ 

GAAGTAGACGAACTGGGGATGGGCTTCCGTCCTGGAGATOT 

GCTAATGGCGGACAGGCTTT C CTGGCAQAGGTCCCTGATTCTGAGGAAGGGGAGTCCGGGTGGACT 

GACACAGAGTC&GATTCAGACTCl^GCC^ 

GCC^TG(^GAAGCGCCCCTTTCCTXATGAAATTGATG 

GTCCTTGCCTTGCTTCAGAAATCTGATG^^ 

AACAATGCCAATTATTCATGC^^ 

AAC^TGATCAACAAAACTGAT CCAC^CATTAAGGAAAAAGC CTTAATGGCCATGAATAACCTGAGT 

GAGAATTATGAAAATCAGGGCCGGCTTCAG^^ 

TCTAACCTGAACTCAGCAGTTCAAGTAG 

GACTACCAACACCTGCTTGTCAATTC^ 

AAAATCAAC^TTGAGATTTTGAAAATCCTTTCGAA^^ 

CTTCTCAGTACCCAAGTGCCAGCATCATTTAGTTCCCTCTATAATTG 

CTT^TTAA£GCCCTT^^ 

AGAGAATTCAATAAAGGTTCCCTTTTTT^^ 

AGAGCCTXAGCAAATCACCATGACCT 

>APP1 

GAGGAAGAGGAGGAATCCTTCCCACAGCC^^ 

gagksaagaggaaacggtcccacccccaagctc^ 
accccxwsgcccacagacggtgtggat^ 

gggttcctgagggccaagatggacctggaggagcgtaggatc ■ 
gaa£gggccatggcagacaaccagtccaagaac^ cctgaatgag . 

c^cttccagtcc^ttctgcagactctgga^^ 

acccacgcc^cccgcgtcatcgcccttatcaacgaccagcgccgggctgccttggagggcttcctg 
gcagccctgcaggcagaTccgcctcaggcggagcgtgtccxgttggccctgcggcgctagctgcgt 

GCGGAGCAGAAGGAACAGAGGC^C^CGCTGCGCCAC^ CGCCGTGGATC C CGAG 

AAGGCAC^GCAGATGCGCTTCC^GGTGC^^ 

AGCCTGGGCCTGCTTGA.CCAGAACCCCCACCTGGCTCAGGAGCXGCGGCCCCAAATCGAGGAA 

CTCCACTCTGAACACCTGGGTCCCA^ 

AAGGGTGGGCTGCACTCCTCCAGATTCC^GGAT 

GAACAAGATGC TGCATC CCCTGAGAAAGAGAAGATGAAC CCGCTGGAACAGTATGAG CGAAAGGTG 

AATGCGTCTGTTCCAGGGGTTTCCCTTTCC^CTCA^ 

CTGGGACAGGGGTGtCCCGTGAGGCTGTGTCGGGTCTGC 
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CGGCCC^GGACGAAC&T^ 
CCCT<^<^CTX<^GAG'<^ 

GCAGAGATTGAC^TCCTGGCTGC^GAGTACATCTCCACGGTGAAGACGCTGTCT ■ 

GTGGAGCGCCTGCAGAAGAT c cagaacgc c tac^gcaagtgcaaggaat AC agtgacgacaaagtg 

CGCTTTGtaAGCAOATCTGAA^^ 

ttaaaaaaa^gccggggtcagaaagaaaaaagagggtcccggggccgaggcaggaggac^ 
gaagacac^ccaaagaaaaagaagcacaaa^ 

(^CCCCTCimTGTGCTGGACATGC 

TGCGTtKACCTTACCACGAAACCCAAAGGAAAATGG 
>BAIP2 

AGCCAGCAGGCCAGCGTGACC^XGCACGAXGXGGACGCCG^ 
XGCTACACGGGTCGTGTGTCXCTCAGTGAGGCCAAXGT 

ctac&gctggaaxatgtgcgggaagcctgtgcctc 

TG<^CCGCC^TCCTC^^ 
TACATAGCTCACAACTTCAAGClAGCTCy^ 

ctaaccctGggcc^gctgctggctgtcctacgcctggaxagtctggacaxagagagtgagcggact 

GTATGCCAXGTAGCXGTGCAGTGGCTGGAGG CTGCTGCCAAAGAGCGGGGTCCCAGTGCTGCAGAA 

gtcttcaajgtgcgtgcgctggatgcacttcactgaagaagat 

accaagcccatcgxgaagaagxactgcct 

gacctgttgtacaagtctc'tggtgccagtgccaaacagcagc^gc 

cttgtatctgc^g<^gaaaaxccaccccagagactgggtaxg 

tttggacatccxagagatcccxttctcxgcxatgaccctxactc^ 

XCCCCTXTGACCAGCTTTGCfT^^ 

CATGAC&TCTAXCTAGCTGCTCAG 

TGGCAGCAACTTGGAGATCGCTXGCTGXGXC^^ 

ATCTACATTTTGGGGGGACGAGACCCTATTACTGGAGTTAAGTTGAAGGAAGTGGAATG 

GTTCAGAGAAACCAGTGGGCATTGGTO 

GTGGTTCAGAACTATCTTTATGCTGTCAACAGTAA 

ATGTGGCTGAACTGTGbTTCTCTTi^ 

atctattgtatctgtgac^^ 

axxagtaatattccxtxggaxtcagagacccacm 

cttgtcatcacttctacaaccccacaa^ 

agggaagatcagtggaxxaatataggxaccatgxtaggccxxtxgcagtttgacxcxggcxxtatx 
xgccratgtgcxcgtgxtxatccttgcxgcctxgaacctggtcagagxxxtaxtactgaggaagax 

GATGCACGGAGTGAGTCTAGTACTGAAXGGGACTXAGATGGAXTCAGXGAGCXGGACXCXGAGX^ 

ggaastxcaagtxcttxxtcagaxgaxgaagxctgggtggaagxagca 
gatcagcagggttcxxtg 
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>RAIP3 

GGACLkCAATGCCCCCAGAAAAGXAACAGCCGT CATTTATGCTAGAAAAGGAAGTGXCCX CCAGAGC 
AXAGAGAA&AXAAGTTCCT CTGTXGATGCAkCAA^ ' 

c^gaaccaaagatccataatgagatggcatc^^ 

aAga^gatxctcaaggaagaagtaataaggc^tt^ 

atatttggccttact^ 

G^GGrrT^ 

aaggaaggagagagaaa&cagcagaatttx^ 
gatcag&iaa^ 

accggrxcccaagtcctaagcagtattttaccaacttc^ 

agtcacaggaaaaccagacaagaaaagagaactgagatggaat^ 

aa&tocactttgaattcaaatgc^^ 

aagcaggtgctgagagagaaagaagcagctcttgaagaaatgcgt^^ 

>BARD1 

ttggccc^tttcgagtcgctgacctgcagcttccctgtggtttcccgaggcctccttgcttcccgg 

tctccgaggagcctttcatccgaaggcgggacgatgccggataatcggcagccgaggaaccggcag 

ccgaggatccgctccgggaacgagcctcgttccgcgtccgccatggaaccggatggtcgcggtgcc 

tgggcccacagtcgcgccgcgctcg^ccgcctggagaagctgctgcgctgctcgcgttgtactaac 

attctgagagagcctgtgtgtttaggaggatgtgagcacatcttctgtag'taattgtgtaagtgac 

tgcattggaactggatgtccagtgtgttacaccccggcctggatacaa 

caactggaGagcatgaxtcaactttgtagtaa^ 

gatttgaaagaagataaacctaggaaaagttxgtttaatgatggaggaaa 

aaaatgtggtttagccctcgaagtaagaaag 

cagcctgcaataaaaaaagatgc^gtgctcagcaagactcat^ 

cctgcac^tgtttctgagag^gctaaaaagg 

actttagctgaaatcaaccaaaaatggaatttagaggcagaaaaag 

aaagaggaatctaagca&aagctggtatccttgxgtagcc^ 

ataaatggtgaaatagacttactagcaagt'ggctccttgacagaatctgaatg 

actgaagtctcxxxaccattggctgagcaaaxagagtctccagacactaaga 

gt<^ctcctgagaaggtctgc2aa^^ 

ggaaaacgtggccaxc&caaxagactttccagtcccaxxtctaa^^ 
agcaccagxggagaxtxxgttaagcaaacggxgccctc&gaaaa^ 
tcaccaccttc^tgcaaacgtaaagttggtggt^^ 
aattcattagtctttcacc1aggtacacgaccttctacat 

>CA150 ■ 

CAACAATTCATTCCTGG^CCCTGAAGATACTTGTTTGGCCCTGCTGTCXATTTAGCCAAGCACCC 
ACAACACAAGAX CAGAC CCCAAGTTCTGCTGTTTCAGTTGCCACGC CTACAGTTAGTGTTTCAACT 
CCTGCTCCTACAGCCACACCTGTGCAAACCGTTCCCC^GCCGCACCCTC^GACGTTACCTCCTCCT 
GTTCCTC^TTCAGTACCXCAGCGAACAACAGCAATACCXGCTTTTCCACCAGTAATGGTACCTCCG 
TTTCGTGTTCCCCTTCCTGGCATGCCAATTCCACTTCCAGGTGTATTGCCAGGAATGGCCCCTCCT 
ATCGTAC J CATGATACATCCCCAGGTTGCTATTGCAGGTTC AG CTGCXACCTTAGCTGGAGCAACA 
. GCAGTTTCTGAAXGGACTGAAXATAAAACAGCAGATOT 

XTAGAAX caacctgggaaaaacc c CAAGAACT AAAGGAAAAAGAAAAGTT AGAAGAGAAGATTAAA 

GAGCCAArTAAAGAACCCTCXGAAGAGCCTCTGCCAATGGAGACG 

GAGGCTATAAAGGAC^TAAAGGAGGAGCCCAAAGAAGAGGAGATGACTGA^ 

GAGAAGGCAAAGCCAGXTGCT AC TGCTGCTATTC CTGGTACTCC^TGGTGTGTCGTTTGGACTGGX 
GATGAGCGGGTCTTCTTTTATAATCCCACCACTCGTCTTTCTATGTGGGACCGIACCTGATGAXCXG 
AXTGGCAGGGCAGATGTTG^CAAAATTATXCAGGAGCCCCCTCATAAAAAAGGAAXG(^GGAATTG 
AAGAAACTAAGGCACCCA^CTCCGACAATGCTGTCGATGCAAAAGTGGCAAT 

attaaagaggaacaagaattaatggaagaaat^ 

AAG 
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>CGI-125 

ttcgacgcgtccgcgcggaacttcgcccgcgtc^ctgggcttxtgctctgtc^ggctggtcsgcgtt 
ttggtgtcttcgtttottatggccgctgcto 

CGGTTTCAGTTGGAGTTGGAATTTGTGC^ 
CAAAGAGGTTACTTCAAAGACAAAGCTTO 

CACTTCCSA^G^GCT^ 

TGGCMCACTATTCCCGGAAGCGGATG^ 

AACACATCGGGAAAA 

>CGI-74 

GTAGAGWUUX»03^ 

AGTTTTCAGCAGGAGAAACTT CGAGTCTGTGAAGTCTGCT CTGCCTATTTAGGACTTCATGATAAT 
GACAGAeGACTGGCTGATC^^ 

CTTGAAGAATTAAAGAGAGT C GTAG C T GAGAAGCAGGAGAAAAGAAACCAGGAACGGCTGAAACGA 

AGAGAAGA<lAGAG&GAGAGAAGAAAGGGAGAAGCTGAGGAGGTCC^^ 

AAAAGG 

>CLH-17 

ATGGCCCAGATTCTGCCAATTCGTTTTCAG^^ 
GCA7VACATTGGCTTCAGTACCCTGACTAXGGAGTCTGACAAATTCAT 

GGAGAGCAGGCCCAGGTGGTAATCATTGATATGAATGACCCAAGTAATCCAAOT gaatt . 

TCAGCAGACAGCGCCATCATGAATCCAGCTAGCAAAGTAATTG 

CAGATTTTTAACATTGAAATGAAAAGTAAAATGAAGGCTCATACCATGACTGATGATGTCACCTTT 

TGGAAATGGATCTCTTTGAATACGGOT^ 

GAAGGAGAGTCTCAGCCAGTGAAAATGT 

AATTACCGTA^GATGCSAAAC^^ 

GTGGTGG^GCTATG^GCTAT^^ 

GCTAGCTTTGCAGA.GTTTAAGATGGAAGGAAATGCAGAAG 

CGGGGCCAAGCTGGAGGG^GTTAC^TATT^^ 

T/TTCCSUU^GAAGGCAGTGGATGTCTTCT^ 

C^GATCAGTGAAAAGCATGATGTGGTGTTCT^ 

CTTGAGACT ■ 

>CLK1 

GACGCGTGGGTTCTGGAACATCTGAAT 
TTGGAATGG1TXGAGCATCATGK3TC^(^ 

GA.CTTCATTAAAGAAAAT GGTTT XCTACC^TTTCGACTGGATCATATCAGAAAGATGGCATATCAG 
ATATGCAAGTC TGTGAATTTTTTGCACAGTAATAAGTTGACTCACAC^GAC TTAAAGCCTGAAAAC 
ATCTTATTTOTGC^GTCTGACTACACAGAGGOT 
TTAATAAATCCAGATATTAAAGTTGTAGACTO 

ACATTGGTATCTACAAGACATTATAGAGC^CCTGAAGTTATTTTAGCC C T AGGGTGGTCCCAACCA 
TGTGATGTCTGGAGCATAGGATGCATTCTT^ 

GkCGATAGTAkGOAGCATTra CAAAACATATGATA 

CAQAAAACCAGGAAACGTAAATATTTTCAGGA.CGATCGAT 

GGCAGATATGTTTCAAGACGCTGTAT^ 

GAGCGTCTCTTTGACCTCATTCAGAAAATC 

GAAGCCd?TAAAGCATCCTTTCTTTGACCTTCTGAAGAAAAGTATA 
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>DRP-1 

AAGGACAAC TTTACCCTGAT C CC C(^GGGTGTGAACQGGATAGAGQAGCGGATQ^C CGTCGTCTGG 

GACAAGGCGGTGGCTACTGGCAAAATGGATGAGAACCAGT^^ 

GCCAAGATCTTTAACCTGTAGCCAAGGAAAGC^ 

ATCTGGGACCCCGAC^GTTGAAGACCATAAC^^ 

ATCTTCGAGGGTATGGAGTCC^C^ 

CACGTGTACCAGCGCGTCAAAM^ 

TATGACGGXCCTGTGTACG^GGTACCAGCTACACCCAAA 
.TCTTCGCCTTCTAAACACCAGCCCCCAC^ 

GGTGOTCAGATAGATGACAACAATCCC^ 
■ CGCTCGAACATCACCAGCCTCGGT 

>EF1A 

ATGt^CCATGAAGCTTTC&GTC 
TCTGTCAAGGATGTTCGTCGTGGCAACGlT^ 

gctg^cttcactgctcaggtgattatc^ 
gtattggattgccacacggctcac&ttgc^^ 

cgttctggtaaaaagctggaagatggccctaaattcttgaagtctggxgatgctgcgattgttgat 

atggttcctggcaagcccatctgtgttgagagct 

gtxcgtgatatgagacagacagttgcggtgggtgtcatc^ 

gctggcaaggtgaccaagtctgcccagaaagctcagaaggctaaa " 

>EPlG;(bait) 

gcggctgggaccctgtacacgtatcctgaaaac^ 
tac^cggggctcaggtccgcgtgctc^^ 

acccctgaatttctccgcaaatttcctgccggcaaggtcccaggatttgagggtgatgatggattc 
tgtgtgtttgagagcaacgcc^^ 

gaggcagc^gcccaggtggtgc^gtgggtgagct^ 

acctgc^tgttcccc^ccttgggc^tcat^ 

gaagtc^ggcgaattctogggctgctggatgcto 

cgagtgac^ttggcxgacatgacagttgtc 

ccttctttccgccaggccto^ 

ttccgggctgtcttgggcgaagtgaaactgtgtgagaagatggcccag.tttgatgctaaaaagttt 

GCAG AGAC C C2AACCTAAAAAGGAC ACACGACGGAAAGAGAAGGGTT CACGGGAAGAGAAGCAGAAG 
C C C CAGGCTGAGCGGAAGGAGGAGAAAAAGGCGGCT GCC C CT G ctc ctgaggaggagat GGATGAA 

tgtgagcaggcgctggctgctgagccc^ggccaaggaccccttcgct 
tttgtgttggatgaatttaagcgcaagtactc 

ttctg(^agcactttgataaggacggctggtccctgtggtactgagagtatcgcttccctgaagaa 
ctcactc^gag cttcatgagctgcaatctca.tcactggaatgt'tc cagcgactggacaagctgagg 
aagaatgccttcgccagtgtcatcctttttgg^^ 
gtcttccgaggccaggagcttgcctttccgctgagtc 
acatggcggaaactggatcctggcagcgaggagaccc^^^ 

GAGGGGGCCTTCC^GCATGTGGGCAAAGCCTTCAATC^lGGGCAAGATCTTCAAG 
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>EFlGCprey) 

GCGGCTGGGACCCTGTACACGTATCCTGAAAACTC 

TACAGCGCK3GCTCAGGTCC 

ACCCCTGAATTTCTCCGCAAkTTTCCTGCC^^ 

TGTGTGTTTGAGAGCS^CGCC&TTGCCTA^^ 

GAGGCAGC&GCCCAGCT^ 

A^CTd^TGT^^ 

GAAGTGAGGCGAAXTCTGGGGCTGCTGGATGCTTACTTG^GACGAGGACTTTTCTGGTGGQCGAA 

CGAGTGACATTGGCTGACATCAGAGTTGTCTGCACCC 

CCTTCTTTCCGCCkGGCCTTTCCC2^T^^ 

TTCCGGGC TGTCTTGGGCGAAGTGAAACTGTGTGAGAAGATGGCCCAGTTTGATG CTAA&AAGTTT 
GCAGAGACCCAAC C TAAAAAGGACACACCAC GGAAAGAGAAGGGTTCACGGGAAGAGAAGCAGAAG 

ccccaggctgagcggaagqas^ 

tgtgagcaggc6ctggctgctgagcccaaggccaaggacccct^ 

cttgtgttggatgaaittaagcgcaast^^ 

ttctgggagc^ctttgataaggacggctggtccctgtggtactcagagtatcgcttccctgaagaa 
ctcactcagaccttcatgagctgc^ 

aagaatgccttcgccagtgtcatcctttttgg 
gtcttccgaggcc^ggagcttgccot 

acatggcggaaactggatcctgggagcgaggagacccagacgctgg^ 
gagggggccttcc^gcatgtgggcaaagccttca^^ 

>FEZ1 . 

ggc^ctggtctgacactgagatccatgagaaagaagaggaagagttcaatgagaagagtgaaaat 

gattccggtatcaacgaggagcctctgctcacagc^ 

atgc^g^ctccccagaccctgaggaagt^^ 

tcctcccaggcagactcggtcctcctgc^ggai^^ 

tggtcctatgaagggctgaggc&catgtct^ 

ggtgccatccgtgacttctcggagcw^tggtg^ 

gagaaggaagtgaagaactcctttatcacggxgct^ 

cgagaactgatgaaaaagaggcggaaagagaaagggcrga^ 

GGAAACCAGATGCCTCTCAAGCGCTTCAGCAT 
CGCGAGACCTTTGGCTCCTCAGGAACTGACAA^ 

A^GCCTCTCCTCCCTCAGTGGAAGACCTGC^GATGCTGACAAACATTCTCTTTO 
GATAATGAGAAGGTGCCTAGTTTGCTAACGGACTACATTTTAAAAGTGGTCTGCCCTACC 
>G45IP1 * 

atggcgtcgagcggcggggagctagggagtttatttgat^ 
acacgggcgaaatatcgac^gggacgacggcctcgtgctgt^ 

tctc^gtacttattaa'TacaaggagttcctgctgtgggagtcatgaaggAattagttgagcgattc 
gctttatatggtgcaattga&£a^^ 

GTTTATCTTATTAAATTTATGAACTTA^ 

agtttcttcggtggattgcttgatgtgtgctatg^ 

aaaaaactacaaatgcggaaggcatatgtagtaaaaactactgaaaataaagac 

aagzuigaaattggttacagagcataa^ 

tctggattttgtaaagctgctttgaacacttctgcagggaactcaaatccttatgttccgtattcc 

tgtgaattgcgtttatgttatttctc 

gc^ccagactcctctaaggatggtagaaa^ 

tctttgcggaaaacacagataaactctttgaaaa 

attacgtcttcagaggcagttgacagatttatg^ 

agaagagaagatgatcgtaaacttggaacttttcttgaaacaaacgcaactggtaatga 
attggacgtctgttagc^gacatctctaaagtggatatgcacgatgactcattgaatacaacggcg 
amttaattcggcataaacttaaaga^^ 
gaagatgtaoatacaagxcatccattaal^acaaagaagaagaata 
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>G45IP2 ' 

AGGACCTGFATGCCCTATATAl'CT 

AATGAGAAGATGCtGaGTGACAGCCACGGCGTC^GACCA 

CTTGGGGGCCCATCCTTCTTCAAAATCATCACOT 

GCCGAGGAAGCCGCCCTGTGGAGGGATCTGGTCCGCA^GTCCTG^^ 

CGG3^j3^GCTTCCTO 

GGCTGCTTCTGCGCAGGGTGCTCCCGGCAGATCGGCTTC 

TTCTCTGGCCTCTAITACTGT^^ 

ATCCACAACTGGGACCTC^CCAAGCGCCCGATCTGC^ 

cgggcccagcccctcatcaacctgcag^tggtgaacgcgtct 

CACCTC^TTGGGAGGAGACGGGAGCAGCTGAAGCTCCT 

ggcgccctgaaggagct 

ttcagtgttgctgacctccaacagatcqcagacg(3ggtgtatg^ 

gaatttgcctcccagc^tgtctacc^ct^ 

tgccagcagc^cgacmcatcttcccctt^^ 

accgt£ttcgA£Cagagctgccaggctgtggtc 

aagtaccaggaacagaacattttcgcc 

>G45IP3. 

cctaac^ggggccctctc^gccctcctaatgacctcgggcctagcc^tgtgatttcacttccactc 

cataacgctcctcatactaggcct^^ 

aacacgagaaagcacataccaaot 

aatcctatttattacctc^ga^^ 

cctagcccctaccccccaattaggagggc&ct^ 

agakgtcccactcctaaacacatccgtattactcgc^^ 

>GADD4.5G . 
GGTGCAGGCC^TGAGCCGGGATTGGAGTGTGGTTGGAGTT 

CCOTGGCTGGGGTCTCCGCCGGGCCCTCCGGCCGGCTCCCGCTCACTGCGCTGGCTCCTCCGCAGG 
ATGCA0GGTGGCGGGAAAGCGCTdCATGA^ 

GGCGGQGTCTACG^GTCAGCCAAAGTCTTGAACGTGGACCCCGACi^TGTGACCTTCTGTGTGCTG 
GCTGCGGG.TgAGGAGGACGAGGGCGA^ 

TGCTGCGAGAACGAGATGGACATAGTGCGCGTGGGCGATGTGCAGCGGCTGGCGGCTATCGTGGGC 
GCCGGCGAGGAGGCGGGTGCGGCGGGCGACCTGCACTGCATC 

GCCTGG^GGATCCCGCCTTGGAGAAGCTCAGCCTGTXTTGCGAGGAGAGCCGCAGCGrTAACGAG 
TGGGTGCCCAGCATCACCCTCCCCGAG 
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>GIT1 

CCAGAGAXGGCTGAC&GATCTCG^^ 

AAAGCTGCTAAGAAGAAGCTGC^GGCGCTCAGG^ 

TATG&CG&GGTGGATCGA&GAGAAAATGATGCA^ 

GTGACAC^GCGCAGTGCCGTGCCCTTCCT 

GGGCC^ CAAAAGCTGGC CCGCTTT^ 

CGGAGCCAGAGTGACCTGGACG^CCAACACGkC^ 

CAGGAGCCCCTGCGCAGCACCGGCGCCACTCGGAGC^CCG^^ 

TTGTCTGACG&kSGCTGTGAC^C 

GCAAAGGTGC^GCAGCTCATGAAGGTCAACAGT^^ 

GAGATCCACAAGCTGCAGGCGGAGAACG^ 

CraCTCCCCAGTGAACGGG^ 

CGCC&GGCCTTTTCCATGTATGAACCTGGCTCTGCCCT 
GAGCTCACTACGCGGCTGCAGCCTTTCCACAGC^ 

C^CGTCCCTGCTGGCCTTTAGCGdATCCGGAAAGGGGTGTCTGCCTCAGCTGTGGCCTTGACTCCC 
TCCTCCCCGCTGCTGTCCTGCTCCCAGGAGGG^GCCGCC&^ ' 
AGTGGAGCCGACAGTGACTATGAGAA^CGCAAAGTO^ 
AGGTTTCTAGAGCTGGGCAAAGAGGAAGACTTCCACCCAGA 

GATCCTGGGCTT CC CAGCACAGAGG AT GTCA.TCTTGAAGACAGAGGAGGTCAC CAAGAA.C ATTCAG 
GAACTGTTGCGGGCAGCCC^GGAGTTC^GCATGACAGCTTCGTGCQCT^ 
TTGGCTGTGAC CGAGATGGCCTCCCTCTTCC CA^GAGGCCAGCfC^ 
CTGCGGCTGCTGAACGCCAGCGCCTACCGGCTGCAGAGTC 

CCCGGGGCCCC^GTGGACTTCC^GCTGCXGACTCAGCAGGTGATCCAGTGCGCCTATGACATCGCC 

AAGGCTGCCAAGCAGCTGGTCACCATCACCACCCGAGAGAAGAAGCAG 

>hADA3 

AAAGATGTGGATGC CCTGCTGAAGAAGTCTGAGGC CbAGCATGAACA CGGAAGATGGATGCCCC 

TTTGGTGGCCTGACGCAGCGCCTCCTGCAGGCCCTGGTGGAGGAAAATATTATTTCCCCTATGGAG 

GATTCTCCTATTCCTGACATGTCTO^ 

AATCAGAACAAGCCCTTCAGTGTGCCGC^ 

ATTGCCCAGGGCCTTTTGGAGT.CTGAGGACCGCCCCGCAGAGGACTCCGAGGATGAGGTCCTTGCT 
GAGCTTCGCAAACGGraGGCTGAGCTGAAGGCACTTAG^^ 
CTGCTGAGXSCTGGCAAAGGAGGAGGTGAGCCGGC^ 
AACGAGGTCATGGACGCCTTTCGCAAGATC&TGGCTC^ 

AAAGAAAAGGACCAGGCCTGGAAGACTCTGAAGGAGCGTGAGAGCATCCTGA^GCTG^ 
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>HB01 

. GACGCTGAGAGGCAGGAGGC&CTAGGGATCGTCCGCAGGATTGGG^^ 

GAGCCGGCCGGAGCCACCGTTCCTG^TGCTGCCGGCGCTGCCCGaA.TCGGAACCGTCGGGCCGCAG 
CCGCCH3GCAATGCCGCGAAGGAAGAGGAATGCAGGC&GTAGTT^ 

TTrTOTAC^GATCTCGAGCaCACAG^GAGTTC^AMGTGATGGCACATCCraACGATCTGCTCGA 
GTC&CCCGCTC^TCAGCC aflGCTAAQ CCAGAGTTC 

CCAGTGACACCGA2UiAAATACCCTCTTCGGCAGACTOGTTCAl'CTGGTTC^GaAACTGAGCAAGTG 
GTTGATTTTTCAGATAGAGAAACXAAAAATACAGCTGATCATGATGAGTCACCGCCTCGAACTC^ 
ACTGG&AATGCGCCTTCTTCTGAGTCTGACATAGACATCTCCAGCCCCAATGTATCTCACGA.TGAG 

AGCATTGCCAAGGACATGTCCCTGAAGGACTCAGGCAGTGATCTCTCTCATCGCCCCAAGCGCCGT 
CGCTTCCATGAAAGCTACAACTTCAATATGAAGTGTC^ ' 
CTTACAGGAAAAGATGAGAGACATTTCTCCATCTCAGGATGCG 
GACGA&TGCAAGGTGAGAGCACAGAGCCGK^^ 

Cy^GATGACAAGAACAGGCATGCAACCAGGCACCAGrGCACCAACGGAGAGGCAGciTCGATATAAG 
GAAAAAGTGGCTGAACTCAGGAAGAAAAGAAATTCTGGACTGAGCAAAGAACAGAAAGAGAAATAT 
ATGGAACACAGACAGACCTATGGGAACACACGGGAACCTCTTTTAGAAAACCTGACAAGCGAGTAT 

gacttggatcttttccgaagagcacaagcgcgggcttcagaggatttggagaagttaaggctgcaa 

GGGCAAATC^CAGAGGGAAGCAACATGATTAAAACAATTGCTTTTGGCCGCTATGAGCTTGATACC 

tggtatcattctccatatcctgaagaatatgcacggctgggacgtctctatatgtgtgaattctgt 

TTAAAATATATGAAGAGCCAAACGATACTCCGCCGKKA^^ 

CCTGGTGATGAGATATATCGCAAAGGTTCAATCTCTGTGTTTGAAGTGGATGGCAAGAAAAACAAG 
ATCTACTGCC^AACCTGTGCCTGTTGGCCAAACTTTTTCTGGACCACAAGACATTATATTATGAT 
GTGGAGGCCTTGCTGTTCTATGTTATGACAGAGGCGGACAACACTGGCTGTCACCTGATTGGATAT 

ttttctaaggaaa&gaattcattcctcaa 

atgagacagggctatggc^gatgcttattgatitcagttatttgctttccaaagtcgaagaaaaa 

GTTGGCTCCCCAGAACGTCC^CTCTCAGATCTGGGGCTTATAAGCTATCGCAGTTACTGGA^GAA 

GTACTTCTGCGCTAGCTQCATAATTTTGAAGGCAA^GAGATTTCTATCAAAGAAATCAGT(^GGAG 

ACGGCTGTGA&TCCTGTCGACATTGTCAGGACTCTGCAAGCCCT 

GGAA?VACACCTAGTTTTARAGAGAC1AGGACCTGATTGATGAGTGGATAGCCAAA 

TCCmACTCC^TAA^CCATGGATCCCAGGTGCTTAAAATGGACCCCTCCCAAGGGCACT 
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>HD1 .7 

ATGGCGACCCTGGRAAAOCTGATG^GGCCTTC^TCCCTC^GTCCTTCCAGCAGCAGC^GC^G 
CAGCAQCAGCAGCAG<^GCAGCAGC^G^ 

CCGCCGCCGCCGCCGCCTCCTCAGCTTCCTCAGCCGCCGCCGCAGGCACAGCCGCTGCiTGCCTCAG 

CCGCAGCOGCCCCCGCCGCCGCCCCCGCCGCC^CCCGGCrc^ 

CGACCAA&GAAAGAACTTTC&GC1A 

AAAGTTATCAAAGCTTTGR.TGGATTCTAATCTTCGAAGGTTAC71GCTC 

AAAAAGAATGGTGCCCCTCGGAGTTTGCGTGCTGCCCTGTGGAGGTTTGCTGAGCTGGCTCACCTG 
GTTCGGCCTGAGA^TGC^GGCCTTACCTGGTGAACCTTCTGCCGTGCCTGACTCGAACAAGC^ 

agacccgaagaatcagtccaggagaccttggctgcagctgttcccaaaattatggcttcttttggc 

AATTTTGCAAATGACAATGiAAATTAAGGTT^^ 



CAATATTTCTATAGTTGGCTACTAAATGTGGTCTTAGGCTTACTCGTTCCTGTC^AGGATGAAC^C 
TCCACTCTGCTGATTCTTGGCGTGCTGCTCACCCTGAGGTATTTGGTGCCCTTGCTGCAGCAGCRQ 
GTCAAGGACACAAGCCTGAAAGGCAGCTTCGGAGTGACAAGGAAAGAAATGGAAGTCTCTCCTTCT 
GCAGAGCAGCTTGTCCAGGTTTATGAACTGACGTTACATCATACA(^GCACCAAGACCA<^TGTT 
GTGACCGGAGCCCTGGAGCTGITGCAGCAG^TCTTC^GAACGCCTCC^CCCGAGCTTCTGCAAACC 
CTGAGCGCAGTCGGGGGCATTGGGCAGCTCACCGCTGCTAAGGAGGAGTCTGGTGGCCGAAGCCGT 
AGTGGGAGTATXGTGGAACTTATAGCTGGAGGGGGTTCCTCATGCAGCCCTGTCCTTTCaAGAAAA 
CAAAAAGGCAA^GTGCTCTTAGGAGJ^GAAGAAGGCTTGGAGGATGACTCTGAAT.CGAGA 

GTCAGCA.GCTCTGCCTTAACAGCCTCAGTGAAGGATGAGATCAGTGGAGAGCTGQCTGCTTCTTCA 
GGGGTTTCC^CTCC&GGGTC^GCAGGTC&TGA^ 

GCAGGCGGACTCAGTGGATCTGGCCAGCTG 

>HDdl.O . , ' 

ATGGCGACCCTGGAAA2U3CTGATGAAGGCCTTCGAGTCCCTCAAGTCCTTCCAGC2AGCAGCAGCAG 
CAQC^GCAGCAGCAGCAGCAGCAGCAGGA_GC^ 

-CCGCCGCCGCCGCCGCCTCCTCkGGTTCCTCAGCCGCC^ 

CTOCAGCCGCCCCCGCCGCCGCCCCCGCCGCCACCCGGCCCGGCTGTGGCTGAGGAGCCGCTGCAC 
CGACCAAAGAAAGAACTOTC&GCTACCAAGAAAGACCGTGT^ 

AACATAGTGGCACAGTCTGTC&GAAATTCTCCAGAAT^ 

CTTTTTCTGCTGTGCMT^^ 

AAAGTTATCAAAGCTTTGATGGATTCT^ 

AAAAAGAATGGTGCCCCTCGGAGTTTGCGTGeTGCCCTGTGGAGGTTTGCTGAGCTGGCTCACCTG 
GTTCGGCCTCAGAA&TGCAGGCCTTACCTGGTGAACCTTGTGCCGTGCCTGACTCGAACAAGCAAG 
AGACCCGAAGAATCAGTCCIAGGAGACCTTGGCl'GCAGCTGTTCCCAAAATTATGGCTTCTTOTGGC 
AATTTTGCAAATGACAATGAAATTAAGGTTTTGTTAAAGGCCTTCATAGGGAACCTGAAGTCAAGC 
■TCCCCCACCAITCGGCGGACAGCGGCTGGATCAGCAGTGAGCATCTGCCAGCACTCAAGAAGGACA 
C^TATTTCTATAGTTGGCTACTAAATGTGCTCTTAGGCTTACTCGTTCCTGTCGAGGATGAACAG 
TCCAGTCTGCTGATTCTTGGCGTGCTGCTCACCCTGTGTAGA 
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>KDctX-3 

CCftAGGTTACAC^TCGAGGTCTAT^ 

gccctgtggaggtttgctg&gctggct(^cc . 

aaqcttctocx^tgcct^ 

gcagctgttcccaaaattal'ggcttct^ 

gcagtgagcatctgccagc&ctcaagaagga^ 

tsaggcttactcgttcctc^ 

ctgaggtatttggtgcccttgctgcagcagca 

gt<^caaggaaagaaatggaagt£tct ' 

ttac^tcatacacagcacc^u3a^ 

ttcagaacgcctccacccgagcttctgcaaaccctg 

gctgctaaggaggastctggtggccgaagc^ 

tottcctcatgcagccctgtc^ 

GCCTTQC^GCaTO&CTC^ 

GATGAGATCAGTGGAGAGCTGGOTGCTTCTTCAG^ 

ATCATCAGAGA&TAGC 

>HDsxQ2'0 

ATGGCGACCCTGGAAAAGATGATGAAG^ 

CAGCAGCAGCAGCAGCAGCAGCAGCAGC^GCAGC^ 

CCGCCGC£GCCGCCTCCTCAGCTTCCTC^GCCGCCGCCG(^^ 

CAGCCGCCCCCGCCGCCGCCCGCGCCGCC^CCCGGCCCGGCTGTGGCTGAGGAGCCGCtGCACCGA 
CCG •. . 
>HDexQ51 

' ATGGCGACCCTGGAA^GCTGATGAAGGCCT^ 
CAGCAGCAGplGCAGG^GCA 

c^gcagcagcagcagcagc agcaggag cagcagcag c^gcagcagcagcagcagcagcag cag cag 

caacagccgccagcgcCgccgccgccg 

c^gccgctgctgcctcagccgc^gccgcccccgcro^ 

ctgaggagccgctgcaccgaccgtgagtcgac • „ 

>HIP1 . 

gctgacaccctgcaaggc^ 
taccgctccagcaacctggagtact 

• CCCAkCTTCCTGK^GAGCCTCAGC 
GCCTCATCCCGCGACAGCGAGCCAGTCCTAGAGAAGGAT^^ 
CAGAATTTATTTGACAACAAGTTTGATG 

TTCAACAGTCAAAATGGTGTGAACAAGGATGAGAAGGACCACTTAAOT 

AT CAGTG GATTGAAGG CACAGCTAGAA^ACATGAAGAC TGAGAGC CAG CGGGTTGT GCTGCAGCTG 

aagggccacgtcagcgagctggaagcagatct 

gacgactgtgaattcctgcgggcagaactggacgagctc^ggaggcagcgggaggagaccgagaag 

gctc^gcggagcctgtctgagatagaaaggaaagctcaagccaatgaacagcgatataggaagct^ 

aaggagaagtacagcgagctggttcagaacgacgctgacctgctgcggaagaatgcagaggtgacc 

a^caggtgtccatggcc^gacaagccc^ 

tcgttggagcgcatcagtgaccagggcc^gcggaa.gactcaagaaca 

ttgaagcaggaacttgccacaaggcaacgggagcttclaggttctgcaaggcagcctgg 

gcccagtc^gaagcaaactgggc^gccgagttcg^^ 

agtggcgcagctcatagggaggaggaattatctgctcttcggaaagaactgcaggacactcagctg 

a^ctggccagcacagaggaatctatgtgccagctt^ 

gggtccaggaaggctgcggagcaggtgatacaagacgcgtcgacgcggccg 
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>HIP11 

GtfGGACCTTGTCACCGCCT^^ 

GTGGACGTGGGTTTGGCTGCCGATGTAG^^ 

AGCCTGdTCAACGAQCTGGCCTTCACCGCCCGC^^ 

CTGGTCAGCCGGGTGTTCCCAGACAAAGAGGTCATGCTGG^TGCTGCCTTAGCGCTGGCGGCCGAG 

TCG^GGCCGA<^GCCTC^CT 

GTGAAGTCGGTCCAGGCCACGACTGAGAACAAGG^ 

>HIP13 

CCCTGCTGCTCTGAGGAGACCATCGCTT CCCAAGTTTCAGAT^TGATTATTT CTCTGTAAGTGGT 

GACCAGGAGGC^GATCAGCAGGAGTTCGACAAGT 

CAGTCCTAC.CGACGGATGTTCO^^ 

<^ACCTG£TATGGTC&CTC 

CGCCGGGGAACCATTGGAGCTGGTCCGATCCCCATCAA<^^ 

ACCGTCCCAGACCTCCCAGGGGTGTTGCCAGCCCCTCCAGATGGGCCAG^GAGCGGGGGGAGC^C ' 
AGCCCTGAGTCGCCM^ 

AGCGGCCAAGCTTCCGTTAACCCTCCACTTCCAGG^ 

CAC^CAATTCCAGAAAGTGAAGCTGAAGACCAGGAACG^ 

GGCC^GATTCCAGAGAGTGACCCTGCAQACCTGAGCCGAAGGGA 

CTGAACGCCATCCGAAGGGGCGT^ 

TTTTCT 

>HIP15 . 
ATTCACATGGCTCCACCTTATCGAAATCTi^^ 
GAAACCCTOGCACATAGTGTGGATTCCCTTGAC^ 
ACTATGACtATTGACTATC^^ 

AC^GCGAATGCGAGAACGAAGTTTCAeGTTCTGAAZ^^ 
GTGGK2AAAAAAACTATTCAGTGCCAAAGCTCTT-T 
ACAAATGATAATATTCAAATTGTT^ 
AA&ATGTCCTTAATTGATGATGAT^ 
• GAGTTAGCTAAGCAACTACAAGCCCAAAT^ 
>HIP16 . 

GATC1AAGAGGAGAGAAACCAT AGG CAGAT GATAAAGGAAGCTTTTGCTGGG<3ATGATGT CATCAGA 

GATTTCTTGAAAGAGAAGAGGGAAGCTGTG^ 

CCTGGCTGGGGCGAGTGGGGTGiGTGTGGGCCTAAAGCCC^ 

ATTAAAGCCCCTGAGGGTCCTCCAAGAAAAC^TAAGAAT^ 

CGCAACATCCACGCAGCTGCTCATC^GGTACG^ 
. TTCGAAAGGACGATCCAGACCCCCATAG^ 

ACTACTCCCAAGGTCGTCACCAAGCCAGGCCATATCAOT 

TACCGGTCTTCCTCAAGGTCGGACCTGTCTGT^ 

CACAAAAAACAGCTGAAGAAATGCTCTGTAGAT 
. >HIP2 

atggcgaaeatcgcggtgcagcgaatg^ 

agcaaaaatc^^ttaaagtagatcttgtagatgagaattttacagaatt 
ggacctccagagacaccatatgaaggaggaagatac 
. cc^tttaatccccctaaggtccggx ttatc^ct^^ 
ggggctatttgtttggatatcct& 
ttgtcattgc^gcactattggcagctgcag^ 
c^gtacaaacaaaatcccgaaatgttc^^^ 

^accagttxctagtccagaatacaccaaaaaaatagaaaacctatgtgctatcggctt^ 

aatg<^gtaatagt(^ccttgtctc 

agtaactg 
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>HIP5 (bait) 

TTTCTTAAAAGTATTTTAA^^ 

AATCAGAGCXXXAAGXTTGGAZ^TCAAAAAGCAGCAGCTAXC^^ 
GAAA^GGTGCAGAA^^ 

GAAAkCAAXGCXGAA^CAGTCATTGkCTGAAGAAT^ 

h ^"^ ^^ft ffi^lT?! ^ ft ^fy 1 ^^ ^ ??rdftft (1 A T A A TT A f^rCff T-TT(^^AG^^G^fiSSG^A^ArE=E£^?^ 

6eT"GAM£&^ 

GCAGACGATATGCCXXXGAACXGXXXTAXACCTTCAGGXTATAACTTTGCTJ^^ 
GCCTCAAAAAAAtSAAGAZU^ 

CAAAGAGGTAGAGCAAAAATAATTAGAAAACC^^ 
ACAAACAGAAAAGGCGCT^^ 

GAGGGAAAAtTAATTATACGTTGTCCTCCTCCTCAATCTACATCAAATATTAGAAGT 

AXAC^GXGTCTCAGTGTGAACGAGXAAC-TCCT^ 

TTTAATTCAAAAC^TGTGCTTCC^OAGAA 

CCACTCTCAAATGCTTGTTCTOACCTAGTCACTO 

GAGTGCO^CTTTCGCAA^AATJAA^ 

ACATOATATTGC^CC^^ 

GCTGAAGAAGAATCAGTTCCCTTATGGAAAAGAGGTCCTAATGTCCTGCATCAAAATAAGAGGG 
ACAGGGTCTACTGTTATGAGAAGAAAACGAATTGCTGAAACTAAGCGGAGAAATATTTTAGAGCAG 
AAAAGACAAAACCCTGGATCTGTAGGAC&GAAGTACAGTGAGCS^ 
GTCCTGCTAAGTTCAAGTGAGCCAAAACAA^ 

GAXAGXACXXCXGAGXXTTTGATGGCXGAAAACTTAGTGAAAGCATCAGXGCCG 

CXGACTGXCXXGAATAGCAAACAGATACMAAAT 

AACATCTGC&CACTGTC&GCTGAAGA^ 

CTACATTATATACAAGAATC CATTTGCAAAAACCCAT CCATCAAAAAXACTXT 
CTTCTGGAGAAGAGAGAAGATAC^CCAGCAGCT<iCAGAGACAAGAGA 
>HIP5 (prey) ■ 

XTTC TTAAAAGTATTTTAAAGAAAGAATCTAAATATGAACATGGTTAT CTTAAGGCATTAATTATA 

AATCAGAGCTTTAAGTTTGG&i^TCAAAAAGC^^ 

GAAAAAGGTGCAGAAATTCCAAAGACTATTAAAAAACTC^ 

GAAAACAAXGCXGAAAACAGTCATTCACTGAAGAATAAAAC^^ 
. CAATTCCACATTCAAAGTGGTGCTGGAAGGAACATAATTAGTGTTXCTACTTGTGCTGTAAA 

GCTGATACAAAGAAGTC GAGGGAGGATTCTAXCXCTGAAAAXGXXACGAC TTTAGGAGGAX CXGGA 

GCAGACCATATGCCTTTGAACTGTTTTATACCTTC^GGTTATAACTTTGCTAAACATGCCTGGCCA 
, GCCTCAAAAAAAGAAGAAAGTAAAATGCCTGTACATGATGA 
' CAAAGAGGTAGAGCAAAAATAATTAGAAAACCAGGATCTGCAA 

ACAAACAGAAAAGGCGCTGTCATTCAACCiA.CAGTCTGCAAGCZAAAGTG 

CAGGGAAAAXXAAXTATACCXTGXC CTCCTCCTCAAT GTACATCAAATATTAGAAGTGGTAAAAAT 

atacaagtgtctcagtgtcaaccagtaactcctgaaaatcctcaaaacattattacacataa 
xttaattcaaaacatgtgcttccaaqkgaac^c^^ 

ccactctcaaatgctxgtxcxgacctagxcacxgxgataccatcactgccaxcaxattgtxcxxca 
gagxgcwaaacxttcgcaaaaataaatcattcs^ 

acattaxaxtgcaccc^uuvgaagxccxcSxxxgtgaagaaagxxaxccgtcxgtgacxcxaagaacx 
gcxgaagaagaaxcagttcccttaxggaa2^gaggxccxaaxg 

acagggtctagtgxxaxgagaagaaaacgaattgctgaaactaagcggagaaaxatxxxagagcag 

aaaagacsu^aacccxggatctgtaggacagaagxacagtgag^^ 

gtcctgctaagttcaagxgagccaaaacaaactacaaggggtacot 

gaxagtacxxcxgagxttttgatggctgaaaact 

ctgacxgxcxtgaatagcaaacag^xacagaaaxca^ 

AACATCTGCACACTGXCAGCTGAAGAACAGAAGATCCXAGAGTCCCTTAATGAXGTGAATGAAAGA 

CTACATTATATAGAAGAAXCCATXXGCAAAAACCCATCCA^ 

CTTCTGGAGAAGAGAGAAGATAGAAGCAGCAGCXGCAGAGACAAGAGA 
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>HMP 

CAAfiAACAAGTTAAAATTGA 

gtcactctgcaggctattgcagctcagaat^ 

TTGAAAGCCGCGATGGAC^TTCTGAGATTGCA 

gagggtgcatigaaggaacgcagaaaggc^gt^^ 

eCTeM&T^CTGCTC 

AAtiGTCCAAGCAGCTCAGTCTGAGGCTAAGGTTGTATCTCAGTATG^TGAGCTC 

CGGGATGACTTTAAACC^GAGCTGGACAGT^^ 

AGTGTTTCAGACTTAGCTGACAAGCTCTCTACTG^^ 

GGTCGTATTGATCAGCTGAACAGAGAGCTGGC^^ 

TTAGCCTTGGAGAAACAAAAGCTGGAAGAAAAGCGGGCATTTGAC^ 

GAAGAT-CACAGAAGTGAAATAG&GGCTC , 
G&AAkTGAAATGAGAACCC^GCTTCGCC 

CTTAGGGTACAAGAACAGGAATTGAAGTCTGAATTTGAGCAGAACCTOT 
CAAGAATTACAATTTCGTCGTCTCAGTC^ 

GCCTATGCGAGACTGA.GAGGAATCGAACAGGCTGTTCAGAGCCAT 

AGAAAAGCCCACCAACTCTGGGTTTCAGTGGAGG^CA^ 

GJUUU^CCTACTATCCCGCTGGGTAGTGC 

ttcacccaagctttaacggcagctatccctccagagtccctgacccgtggggtgt^ 
acccttagagcccgtttctatgctgttc&aa^ 

• agaaatagcttgtaccagtacttcctctcctacctac^gtccctgctcctattcccacctcagcaa 
ctgaagccgcccccagagctctgccctgaggatataaacacatttaaattactgtcaxatgcttcc 
tattgc^ttgagcatggtgatctggagctagcagcaaagtttgtcaatcagctgaaggg^ 
agacgagtggcacslg^ctggctgaaggaagcccgaara 
atcctgacagp^tatgccagcgccgtaggaataggaacizactcaggtgc^ccagag 

>HP28 

GCGCCCGC^C^CTCTTTGCTCAAGTACGAC^^ 

agccccaaggctcggctactgaaagtcagccccc 
ccacccaagaccaagctcccctcaactccct 

" ttgaatgccatactaccgccaagggagtgggtggaagacacgcagctatggatccagcaggtgtcc 
agcacccctagcaccaggatggacgtggtgc^cctcc^ggagcagttagac^ 
cggcaggccagggaaftgaggcatctgccctgtccgcag^ 

. ttgatccgggaggtcaccatcaactgtggggagagggggctgctgctgctgggagtccgggacgag 
atccgcatgaccatcgctgcgtacc^gacgctgtacgagag(^gcgtggcgtttggcatgaggaag 
gcactgcaggctgagc^ggggaagtcagacatggagaggaaaatcgcagaattggagacggaaaag 
agagacctggagaggcaagtgaacc^ggagaagggaaaatgtgaagccactgag 

' ga^ggcggc^ggtggaggagaagaagcac^tgaggagato 
cagctgaaggcccaactggaaggcattattgcaccaaagaag 
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>HSPC23 2 • 

CC^CGGCGAGCGGACGGCTC^TTTACCSGGGTCTCCCGC^GGGCCAGAGTCGTGGCTTAC^GaAfSA 
CaCGAAATGTGGTCTGAGGGACGATATGAATATGAAAGAATC^ 

AGTCATCCCAGTGATGAATCTGGTTATAGATGGACAAGAGACGATCATTCTGC^GCAGGCmACCT 
GAATACAGGGACATGAGAGATGGCTTTAGAAGAAAAAGTTTCTACTCTTCCCATTATGCGA.GAGAG 

CGGTCTGCI^AAAAGGGACAAXACTT^ 

TTCCATCAGTCTC^CAiAGAAATAAAGAGAGGGCTGTCCAGTCTTTGAAA^ 
XCACCCTCRAGTGGraCKflCAOTTTC 

AAGGAACTTGGTGAGGCTGGAAGCAAGTGGGCTGCTGAAAAG^ 

ttgcctgaaatttctgagtatgaggc'gggatccac^^ 

CCTGAGTCAAACAC^CACATGGGATAGAACTATTTGAAGATAGTCAGCTAACCACTCGCTCTAAA 
GCAATAGGATCAAAAM!CAAAGAGATTGAAC3U3GTT^ 

gtggtgaaaatgctcmtgaaaaagatcct^^ 
aatttacatgaaataggtgagcggtgtgttgaagaactcaagc^ 

tccactcaagattttggagagcctttt 

>HYPA 

C^CGCCGGCGGAGCAGTCTGAGCCCGACGATGAGGGCGGGGACGGGAGCTGAGCGTGGAGGCCTC 
ATGATGGGGCACCCTGGCATGCATTATGCCCCAATGGGAATGC^CCCTATGGGTCAGAGAGCGAAT 
ATGCCTCCTGTAGCTCATGGAATGATGCCGCAGATGATGCCCCCTATGGGAGGGCC^CCAA,TGGGA - 

caaatgcctggaatgatgtcgtcagtaatgcctggaatgatgatgtctcatatgtctgaggcttcc 

ATGGAGCCTGCCTTACCGCCAGGAGTAAATAGTATGGATGTAGCAGCAGGTACAGCATCTGGTGCA 

AAATCAATGTGGACTGAACATAAATCACCTGATGGAAGGACTTACTACTACIAAC^CTGAA^CC^U^ 

CaGTCTACCTGGGAGAAACGAGATGATCTTAAAACACCTGCTGAGCAACTCTTATCTAAATGCCCC 

TGGARGGAATACAAATCAGATTCTGGAAAGCCTTACTATTATAATTCTGAAACAAAAGAATCTCGC 

TGGGCCAAACCIAAAGAACTTGAGGATCTTGAAGGATACCAGAA^ 

ATTACAAAATCAA&CCTGCATGCAATGATCAA&GCTGA 

Aci^CATCAACAGCTCCAGTCCCTACAACAGAAATTCCGACCACAA^^ 

GA&Q{a<3CAGCrGCTQTTOTTGCAGCAGCA 

AATGCTTCCACT'TCTGCTTCTAATACTGTGAGTGGAACTGTTCf^ 

acttccattgttgctactgttgtagataatgag^ 

CAACTTACTAGTACCCCTGCTATTCAGGATCAAAGTGTGGAAGTATCCAGTAATACTGGAC3AAGAA 

acatctaagcaagaaactgtagctgattttactcx:ciaaaaaag^^ 

AAGAAAACATACACTTGGAATACAAAGGAAGAGGCAAAG^ 

aagcgggtaGcatcgaatgcttcatgggagcaggctatgaaaatgattattaatgatccacgatac 

agtgctttggcaaagttaagtgaaaaaaagcaaggctttaato 

a^ga&aaaa&aaaaaaaaaaaaaaaa 

>HZFH 

C^CGCCCGCTTCGCCGAGGCCGAGTGCGTGGCCGAGAGCCACCAGCACCTCTCCAAGGAGTCGCTG 

GCGG«^CAAGCCGGCCAACGCCGTCCTGCAC^GGTTCTGAACCAGCTGGAGGAGTTGCTG^ 

GACATGAAGGCGGACGTGACCCGCCTGCCAGCCACGCTGTCCCGAATACCCCCCATCGCAGCCGGC 

CTTCAGATGTCCGAGCG(^GCATCCTCAGCCGGCTGGGCAGCAAGGGCACGGAGCCTCACCCCACA 

CCGGCCTACCCGCCGGGTCCCTACGCTACACCTCCGGGGTACGGGGCGGCCTTGAGCGCCGCACCC 

GTAGGGGCCCTGGCCGCGGCAGGCGCCAATTACAGCCAGATGCCTGCAGGGTCCTTCATCACAGCC 

GCC^ca^CGGCCCTCCAGTGGTl'GTGAAGAAGK^GAAGGAAATGGTGGGGGCATTGGTGTCAGAC 

' gggctggatcggaaggagccccgagccggggaGgtgatctgtatagacgac 
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CTCAAAGAAGKX3AGTCCGCTGGAGGAGCTGGCCCTC 

ACTGAAAACCTGZ^GATGAAGTATACCATATTTT 

GA&GC3AAGGGAATTAjCA^ 

atttggactcttacttaccagcagaattcagct^^ 

AG^CATGGC^SC^ 
>IMPD2 

GACTTTCTC&TTCTCCCTGGGTACATCGACTTCACT^ 

acc^gaaaatcactcttaagacccgactggtttcctct 
atggcc^tagcaatggcgcttacaggcggtattggcttcat^ 

(^GGCCAATGAAGTTCGGAAAGTGAAGAAATAT^^ 

agcggcaaggatcgc<hgggggatgttt^^ 

ATCACAGAC^CAGGCCGGATGC^GAGCCGCTTGGTGGGCA^ 

ctcj^gaggaggaacatgactgtttcttggaagac^^ 

gcccctgcaggcatcac^ctgaaggaggcaaatgaaa^ 

cccattgtaaatgaagatgatgagcttgtggcca^ 

GACTACCCACTAGCCTCCAAAGATGCCA 

GAGGATGACAAGTATAGGCTGGACTTGCTCGCCCAGGCTGGTGTGGATGTAGTGGTTTTGGACTCT 

• TCCC^GGGAAATTCCATCTTCCAGATCAAT^^ 
CAAGTCATTGC^GGC^TGTGGTC^^ 

GGCCTGCGGGTGGGCATGGGAAGTGGCTCCATCTGCATTACGCAGGAAGTGCTGGCCT 

ccccaagcaacagciagtgtacaaggtgtcagagtatgca 

GATGGAGGAATCCAAAATGTGGGTCATATTGCGAAAGCCTT 

atgggctctctcctggctgccacc^cxgaggcccctggtgaatacttcttttccgatgggatgcgg 
cxju^gaaatatcgcggtatgggttctctcgatg 

TATTTCAGTGAAGCTGACAAAA^ 
. TCAATCGACAAATTTGTCCCTTACCT^ 

AAGAGCTTGACdCAAGXCCGAGCC^TGATGTACTCTGGGGAGCTTAAGTTTGAGAAGAGAACGTCC 
TCAGCCCAGGTGGAAGGTGGCGTCCATAGCCTCCATTCGTATGAGAAGCGGCT^ 

>KPNA2 " 
. (^TTGGGCACTCACTAACATTGCTTCTGGGACAT 

(3£CATCCCAGCATTdATTTCTCTO 

GCTCTAGGAAACATTGCAGGTGATGGCTCAGTGTTCCGAGACTTGGTTATTAAGTACGGTGCAGTT 

GACCCACTGTTGGCTCTCCTTGCAGTTCCTGATATGTCATCTTTAGCATGTGGCTACTTACGTAAT 

CTTACCTGGACACTTTCTAATCTTTGCCGCAACAAGAATCCTGCACCCCCGATAGATGCTGTTGAG 

CAGATTCTTCCTACCTTAGTTCGGCTCCTC 

1?GGGCTATTTCCTACCTTACTGATGGTCCAAA^ 

GTGCCCCAACTTGTGAAGCTTCTAGGAGCTTGTGAATTGCCAATTGTGACTCCTGCCCTAAGAGGC 

• ATAGGGAATATTGTCACTGGTACAGATGAACAGACT 
GTCTTTCCCAGCCTGCTCACCAACC CCAAAACTAACAT TCAGAAGGAAGCTACGTGGACAATGTCA 
AAGATCACAGCCGGCCGCCA^ACGAGATAGAGCAAGTTGTGAATC^TGGATXAGTCGCATTCCTT 
GTCAGTGTTCTCTGTAAGGCAGATTTTAAGACACAAAAGGAAGCTGTGTGGGCCGTGACCAACTAT 

' ACCAGTGGTGGAACAGTTGAACAGATTGTGTACCTTGTTCACTGTGGCATAATAGAACCGTTGATG 

• AACCTCTTAACTGCAAAAGATACCAAGATTAT^ 
CAGGCTGCTGAGAAACTAGGTO&AACTGAGAAACT^ 
GACAAAATTGAAGCTCTAC&AAACCATGAAAM 

GAGAAGTATTTCTCTGTAGAGGAAGAGGAAGATCAAAACGTTGTACCAGAAACTACCTCTGAAGG 
TACACTTTCGAAGTTCAGGATGGGGCTCCTGGGACCTTTAACTTT 
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ttggcagctgtgggctt;^ 

GACGAGGTGATC^GCTGCTTCTGGaWUVTTT 

C^GATTCTGXCAGTGXXTGGTGATAXTGCCCXXGCXAXTGGAGGAGA^^ 
CTXGTAXTQAAlACTCXXCAGClAGGCCTC 

TTCATTGACCACATTOCTGGAGATGAGGATC^^ 

ATAGGGGACTXAXGTACAGraXXTGGK^GGAXGTACTGAAATT^ 

CAXG^XTGTTAACTGAAGGGCGGAGATCGAA 

ACAAAAGAACXGAGGAAACTGAAGAACCAAGCT 

>Ku70 

AAJG^CCQGACCTTTAATAC^ 

ATCTATGGGAGTCGTCAGATTATACTGGAGAAAGAGGAAA(^GAAGAGCTAAAACGGTTTGATGA^ 
CCAGGTTTGAXGCXCATGGGTTXCAAGCCGXXGGXACXGCXGAAGAAACA^ 

TCCCTGTTCGTGTACCC^GAGGAGTqGCTGGTGATTGGGAGCTC^CCCTGTTCAGTGCTCTGCTC 
ATCAAGTGTCTGGAGAAGGAGGTXG<^GC^TXGXGCAG^^ 

TATTTTGTGGCTTTGGTGGC^CAGGAAGAAGAGTTGGATGACCAGAAAATTCAGGTGACTCCTCC^ 

GGCTTCCAGCTGGTCTTTTTACCCTTTGCTGATGATAAAAGGAAGATGCCCTTTACTGAA2\AAATC 

ATGGCAACTCCAGAGCAGGTGGGCAAGATGAAGGCTATCGTTGAGAAGCTTCGC 

AGTGACAGCTTTG^GAACCCCGTGCTGC^GCAGCACTTCAGGAA 

TTGATGGAGCCGGAACAAGCAGTGGACCTG^ 

gck:tccttggtggatgagtttaaggagcttgtttacccaccagattac^ 

ACCAAGAGAAAACACGATAATGAAGGTTCTGGAAGCAAAAGGCCCAA 
GAGCTGAAG&CCCA(^TC^GC^GGGTACGCTGOT 
XGCCGC^CXXACGGGCXGAAGAGTGGGCTG^GAAGCA^ 
TTCCAGGAC 
>LUC7B1 ■ 
GTC(^CG"CGGTCGCG^ 
' QA&AAAATAGGAAAACTCCTXGCXAAAGCCGA^ 
CAGAkGATTCTTATGGAAGTGGAAAAAGTTCGTG^ 
AATTCCATGCCTGCATCCAGTTTTCAfiC^ 

ATTCAGATCCGAGAGAAGGTTGATCAGTTGAGGAAAACTGTCGCTGAAAAGCAGGAGAAGAGAA^ 

CAGGATCGCXXGAGGAGGAGAGAGGAGAGGGAACGGGAGGAGCGXCXGAGCAGGAGGTCGGGAXCA 

AGAACCAGAGATCGCAGGAGGTCACGCTCCCGGGATCGGGGTCGGAGGCGGTCAAGATCmCCTCC 

C.GAGAGC GACGGAAATTGTCCCGGTCCCGGTCC CGAGATAGACATCGGCGCCACCGCAGCCGTTCC 

CGGAGCCACAGCCGGGGACATCGTCGGGCTTCCCGGGACCGAAGTGCGAAATAGAAGTTCT 

GAGCGGGCATCCAGAGAGGAGTCCTGGGAGAGCGGGCGGAGCGAGCGAGGGCCCGCGGACTGGAGG 

CTTGA(^GCTCCAACGGGAAGATGGCTTCACGGAGGTCAGAAGAGAAGGAGGCCG 

?>MAGBH1 _ 
GCATCCTTCCCTAGGACTGCTGTAAGCTTTGAGCCTCTAGCAGGAGACATGGCTCGGGGACG2^AAG 

AGXCGGCGCCGCCGXAAXGCGAGAGCCGCAGAAGAGAACCGCAAC 

GAGGCCXCCGAGACCCCXAXGGCCGCCXCTGTGGTAGCGAGCACCCCCGAAGACGACCTGAGCGGC 
CCCGAGGAAGACCCGAGCACTCCAGAGGAGGCCTCTAGCACCCCTGAAGAAGCCTCGAGCACTGCC 
CAAGCACAAAAGCCTTCAGTGCCCCGGAGCAATTTTCAGGGCACCAAGAAAAGTCTCCTGATGTCT 
ATAXTAGCGCXCAXCTXC^XCAXGGGCAACAGCGCCAAGGAAGCXCXGCT 

AAGXTAGGAAXGCAGC CTGGACGTCAGCACAGCATCTTT GGAGATCCGAAGAAGAXCGXCACAGAA 

gagxttgtgcgcagagggtaccxgatxxaxaaaccggxgccccgtagcagxccggxggagtatgag 
xtcttctgggggccccgagcac^cgxggaatcgagcaaactgaa^^^ 
gxxcstaaccgaxgctcxaaagacxggcctxgtaaxtatgactgggaxtcggacgatgatgcagag 
gxtgaggcxatccxcaaxtcaggtggtaggggtxaxxc'cgccccx 
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C&GCGGCGGAGCTTCGCCC^CCGCTGTA^^ 

atcccggtgatcatcgagcgctacaagggtgag^ 

T^GTCCCGCiACCATGTCZ^ 

CCCACGCAGGC CTTCTTCCTGCTGGTGAACC^GC^C^GCATGGTGAGTGTGTG CACGCCCA.'XCGCG 
>mHAPl 

ccgaaagagcaggtgcagagcggtgcgggagacgggac^^ 
cccacgacccagcctgcagttggtcccgctccot^ 

cagggaaccgggtccggacaaaaatcaggatcccg^ ccatg . 

atcattggtgattcggacgcacc^tggacccgct^^ 

actggcttgggcactggaaaggccgagggaatgt^ 

c ccggcgtgtccggcc ctgagcgtgcggcgtttattcgagac^tgcaqgaagcgttgtgxcctaat 

ccaccacccacgaagaagatcaccgaagatgatgtc^^ 

gaacgggacctgaacacagccgcck2ggatcggggagtccctg 

gaggagaataatakgc^ggaaaccatgct^ 

caggtgaacctgcgagatgaccttctt^ 

gacgaggaagacga.ggaagagggcgaagag<^ggaacgagaaggacagag 

cacgac(^cccctatggtgccccc^gcc^caccct^ 

ctggaaa.ccctgg^gcagaagctca.ggcttctggaggakga 

tcccaccttgacaacctggaggacgaagagcagatgc.t 

gaagccagccagcagatgggkgagctatcggaagtc^ 

cagcagaaagagatcactcagctgkiaggcc 

ggogcc cagacggagaaactgcagc&gatgctc 

cgagctggctcctacatgcaggattatgggagcaggcgtcgtgaccgcc^ggaggatgggaagagt 

(^tcgccagcgctcctccatgcccgcaggctctgtc^ 

g(^cttccaagtttcccagaga.c^ 

gac cct gcgtatttcatggagagac gtgacactcact gc agggaggggcgtaagaagg ag'cagagg 
gcgatgccacccccaccggctca 

>mp53 

GTCACCGAGACCCCTGGGCCAGTGGCCCCTGCCCCAGCCACTCC^TGGCCCCTGTC^TCTTTl'GT^ 

ccttcrc2^aaa^cttaccagggc2vacta 
• aagtctgttatgtgc^cgtactctcgtcccgtc^taagctattctggcagctggcgaagacgtgg 
cctgtgcagttgtgggtcagcgccacacctccagctgggagccgtgtccgcgccatggcgatgtag 
aagaagtcacaggacatgacggaggtcgtgagacgctgcccccaccatgagcgctgctccgatggt 
gatggcctggctcctccccagcatcttatccgggtggaaggaaatttg'tatcccgagtatctggaa- 
gacaggcagacttttcgccacagcgtggtggtaccttatgagccacccgaggccggctctgagtat 

ACCACCATCCACTACJUIGTACATGTGT^^ 

cttaccatcatcacactggaagactccagtggga^ 

-gtttgtgcctgccctgggagagaccgccgtacagaagaagaaaatttcdgcaaaaaggaagtcctt 
tgcccr^aactgcccccagggagcgcaaagagagcgctgcccacctgcacaagcgcctctcccccg 
caaaagaaaa^ccacttgatggagagtatttc^cgctc^gatccgcgggcgtaaacgcttcgag 
atgttccgggagctgaatgaggccttagagttaa^ggatgcccatgctacagaggagtctggagac 
agcagggc tcactccagctacctgaagaccaagaagggc gagtctacttcccgccataaaaaaaca 

ATGGTCAAGAAAGTGGGGCCTGACTCAGAC 
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>NAG4 

CCTCCTQAGAAGGCTC^ 
GAAQCTTTGAATC^ 

OkCTCA^AATCAAAAOT^ 
C&TGACTTG^ 

GGCT^GGGAGAGAGAGAGA^^ 

aaagaaAataaa^^ 

GAGCAGGA*^ 
AGTCAGTGCG^ 
GATCGC^TTGTAGGA*^^ 
TCTGGAGTG^TACTTTG^^ 

TiXaZ^arrATGGGcScCTA 

gatgattctgattta2vtctattcaaccta 

catgact^ttgggcAcgtgcc^agattAtgg 

ACayiAAGGAG.GGCA!i?TCCAGGACCCTA 

ACTAG(^dACTTGACACAGC£^ 

T'CCAGTACTCSUU^CAGGCTCA 

TTTGACTCTG&AG&AG^ 

CCCTCATACAGAGAAATGCATGTTGCTG^^ 
GTAACTCCyU3GT(&TATCGt^ 

CCCGTCATGGAAAACAAGTTTGTGGATTTGACAGAAG 
GCTGAGTGTGGACCTGGTGGAAGT 

Figure 6 (continued) 



WO 2004/113566 



35/51 



PCT/EP2004/006617 



CTCTCTCCCCTGTCCTCTCTCTC^ 
ACCATGAGTTCCTTCAGCTACGAGCCGTAC^^ 

CCCCGGGTGCACATCTCCAGCGTGCGCAGCQ<^TACAGCACCGCACGCTGAGCTTACTCCA.GCTAC 
TCGGCGCCGGTGTCTTCCTCGCTGTCCGTGCGCCGCAGCTACTCCTCCAGCTCTGGATCGTTGATG 

CO^G2j:TI3GAGAACCI£GA£^^ 
ACGdAGGAGA&G^GmdCTCCAG^ 

GAGCTGGAGCAGCAGAACAAGGTCCTGGAAGCCGAGCTGCTGGTGCTGCGCCAGAAGCACTCCGAG 
CCATCCCGCTTCCGGGGGCTGTACGAGCAGGAGATCCGCGACCTGCGCCTGGCGGCGGAAGATGCC 

accaacgagaagcaggcgctccagggcgagcgcGaagggctggaggagaccctgcgcaacctggag 
gcgcgctatgaagaggaggtgCtgagccgcgaggacgccgagggccggctgatggaagcgcgcaaa 
ggcgccgacgaggcggcgctcgctcgcgccgagctcgagaagcgcatcgacagcttgatgqacgaa 
atgtcttttgtgaagaaagtgcacgaagaggagAtcgccgaactgcaggcgcagatccagtacgcg 

CAGATCTCCGTGGAGATGGACGTGACCAAGCCCGACCTTTCCGCCGCGCTCAAGGACATCCGCGCG 
CAGTACGAGAAGCTGGCCGCCAAGAACkTGCAGAACGCTGAGGA^^ 

GTGCTGACCGAGAGCGCCGCCIAAGAA.CACCGACGCCGTGCGCGCCGCCAAGGACGAGGTGTCCGAG 
^GCCGTCGTCTGCTCAAGGCCAAGACCCTGGAAATCGAAGCATC 

^GAAGCAGCXGCAGGAGCTGGAGGACAAGCAGAACGCCGACATCAGCGCTATGCAGGACACGATC 

AACAAATTAGAAAAIGAATTGAGGACCACAAAGAGTGAAATGGCA^ 

' GACCTCOTCAACGTGAAGATGGCTTTGGATATTGAGATTGCAGCTTACAGGAAACTCTTGGAAGGG 
GAGGAGACCCGACTCAGTTTC^CCAGCGTGGGAAGCATAACCAGTGGCXACTCCCAGAGCTCCCAG 
GTCTTTGGCCGATCTGCCTACGGCGGTTTACAGACCAGCTCCTATCTGATGTCCACCCGCTCCTTC 
CCGTCCTACTACACCAGCCATGTCCAAGAGGAGCAGAT'CGAAGTGGaGGAAACCATTGAGGCTGCC 

• AAGGCTGAGGAAGCCAAGGATGAGCCCCCCTCTGAAGGAGAAGCCGAGGAGGAGGAGAAGC^CAAG 
GAAGAGGCCGAGGAAGAGGAGGCaGCTGAAGAGGAAGAAGCTGCCAAGGAAGAGTCTGAAGAAGCA 
AAAGAAGAAGAAGAAGGAGGTGAAGGTQAAGAAGGAGAGGAAACCA^ 

. AAGAAAGTTGAAGGTGCTGGGGAGGAACAAGCAGCTAAGAAGAAAGAT 

>p53 . 

ATGGAGGAGCCGCAGTCAGATCCTAGCGTCGAGCCCCCTCTGAGTCAGGAAACATTTTCAGACCTA 
TGGAAACTACTTCCTGAAAACAACGTTCTGTCCCCCTTGCCGTCCCAAGCAATGGATGATTTGATG 
CTGTCCCCGGACGATATTGAACAATGGTTCACTGAAGACCCAGGTCCAGATGAAGCTCCCAGAATG 
CCAGAGGCTGCTCCCCCCGTGGCCCCTGCACCAGCAGCTCCTACACCGGCGGCCCCTGCAC CAGCC , 
CCCTCCTGGCCCCTGTCATCTTCTGTCCCTTCCCAGAAAACCTACCAGGGCAGCTACGGTTTCCGT 
CTGGGCTTCTTGCATTCTGGGACAGCCAAGTCTGTGACTTGCACGTACTCCCCTGCCCTCAACAAG 
ATGTTTTGCCAACTGGCCAAGACCTGCCCTGTGCAGCTGTGGGTTGATTCCACACGCCCGCCCGGC- ■ 
ACCCGCGTCCGGGCCATGGCCATCTACAAGCAGTCACAGCACATGACGGAGGTTGTGAGGCGCTGC 
CCCCACCATGAGCGCTGCTCAGATAGCGATGGTCTGGCCCCTCCTCAGCATCTTATCCGAGTGGAA 
GGAAATTTGCGTGTGGAGTATTTGGATGACAGAAACACTTTTCGACATAGTGTGGTGGTGCCCTAT 
■ GAGCCC^CTGAGGTTGGCTCTGACTGTACCACCATCCACTACAACTACATGTGTAACAGTTCCTGC 
ATGGGCGGCATGAACCGGAGGCCCATCCTCACGATCATCACACTGGAAGACTCCAGTGGTAATCTA 
CTGGGACGGAACAGCTTTGAGGTGCGTGTTTGTGCCTGTCCTGGGAGAGACCGGCGCACAGAGGAA 
GAGAATCTCCGCAAGAAAGGGGAGCCTCACCACGAGCTGCCCCCAGGGAGCACTAAGCGAGCACTG 
CCCAACAACACCAGCTCCTCTCCCCAGGO^GAAGAAACCACTGGATGGAGAATATTTC^C 
CAGATCCGTGGGCGTGAGCGCTTCGAGATGTOCCGAGAGCTGAAXGAGGCCTTGGAACTCAAGGAT 

GCCCAGXSCTGGGAAGGAGCCAGGGGGGAGCAGGGCTCACrC^ 
CAGTCTACGTCCCGCCATAAAAAACTCATGTTCAAGACAGAAGGGCCTGACTCAGAC 
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TGC^C^GCCACGGCCGGGGGCGTCTTTCAG^G^^ 
^^GACCGGGAAGGTTTCTTTACCAACGGTTT 

ACATAC^TGTGGciGTCGGC^GAGCTGGTAGAGT 

CATGG AG GCGGATTGAATAAGAAGGCATACTCAATGG c aaaatact tgagagactgtgggtt c 
ctStGGAGG^ 

gtgggccc^agtaagagtggactgaagcacgagctcgtcaccagggccctccagctggtgcagttt 
g^ctgtagccctgagctgttca&gaagatca^^ 

gccggcgctgtgcccaggactcc^^ • 
tacttaaacggactgggacggttgcccgcc^gaccctc^^ ■ 

ccgttctttaatatgctggatgagct 

cttc^ggagagcccgtgcatcttcgcattgacgccaagacaggtg^^ 

gaactgcagc ccggagttaaagccgtgcaggtcgtc ctgagaatctgttactcagacaccagctgc 

cctc^ggaggaccagtacccgcc^ • 

ggctactacccctccaataagcccggggtggagcccaaga^ 

•cacctcmgtacctgtcct 

tactcggtggccco?gtack:tg6tgcggc^gctgacctcatcggagctgctgcagaggctgaagacc 

attggggtaaagcacccggagctctgcaag cttgat cctgag 

agcga.gatcgccaccaccggtgtgcgggtgtccctcatctgtccgctggtgaagatgcgggtctcc 
gtgccctgccgggc^igagacctgcgccc^cctgc^gtggttcgacgccgtcttctacctgcagatg 
. aacgagaagaagcccacctggatgtgccccgtgtgcgacaagccagccccctacgaccagctc^ 
atcgkcgggctcctctcgaagatcctgagcgagtgtgaggacgccgacgagatcgagtacctggtg 

gtgctgggcccctcggacgccaatgggctgctgcccgcccgcagcgtcaacgggagcggtgccctg 

ggcagcacgggtggcggcggcccggtgggcagcatggagaat 

gacctgacgctggacagctcatcgtcctcgg^ 
. gaggacgaagaggggccccggcccaagcggcgctgccccttccagaagggg.ctggtgccggcctgc 
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CTC^TGGAGGCC^AAAAC^TGGTGAT^^ 
GTGC^CGGAGTAAGAGTGGACTGAAGCAC^^ 

gactgtagccctgagctgttcaag&agatgaaot^ 

TCGGAGCCTGCCCCAC^GCCGCACC 

G<^e€G£^^e€^^ 

TACTTAA^GC^CTG^ 

ccgttctttaatatgctggatgagctgctgaagccc^ 
gttcaggaig&gcccg^gca^ 

<^CTGCAGCCCGG&GTTAAAG 
CCTGAGGAGGACCAGTACCCGCCCAACA 
GGCTACTACCCCTCCAATAAGCCCGGGGTGGAGCCC^^ 
CkCCTCATGTACCTGTCCTC^ 

TACTCGGTGGCCCTGTACCTGGTGCGGCAGCTGACCTC^ 

ATTGGGGTAAAGCACCCGGAGCTGTGCAAGGCACTGGTCAAGGAGAAGCTGCGGCTTGATCCTGAC 

AGCGAGATCGCCACC^CGGGTGTGCGGGTGTCCCTCA'fCTGTCCGCTGGTGAAGATGCGGGTCTCC 

GTGGCCTGCCGGGCAGAGACCTGCGCCCACGTGCAGTGCTTCGACGCCGTCTTCTACCTGCAGATG 

AACGAGAAGAAGCCCACCTGGATGTGCCCCGTGTGCGACAAGCCAGCCCCCTAGGACCAGCTCATC 

ATCGACGGGCTCCTCTCGAAGATCCTGAGC^^ 

GACGGCTCGTGGTG^CCGATCCGCGCCGAAAAGGAGOT 

GTGCTGGGCCCCTCGGACGCCAATGGGCTCCTGCCCGCCCCCAGCGTCAACGGGAGCGGTG^CCTG 
GGCAGCACGGGTGGCGGCGGCCCGGTGGGCAGCMGGAGA^T 

GACCTCACGGTGGAGAGCTCATCGTCCTCGGAGGATGAGGAGGAGGAGGAAGAGGAGGAGGAAGAC 

• GAGGACGAAGAGGGGCCCCGGCCCAAGCGCCGGTGCCCCTTGCAGAAGGGCCTGGTGCCGGCCTGC 
>PLIP 

* GGGGAGATAATCGAGGGCTGCCGCCTACCCGTGCTGCGGG^^ 
CCCCTGGCCGAGATCCTGAGCGTGAAGGACATC^GTGGCCGGAAGCTTrTCTACGTCCATTACATT 

■ GACTTCAACAAACGTCTGGATGAATGGGTGACGCM • 
CCCAAGAAAGAGGCCAAC^CCCCCACTAAGAACGGACTTGCTGGGTCCCGTCCTG^ 
AGAGAGGTGAAACGGAAGGTGGAGGTGGTTT CACCACX1AACTC CAGTGCCCAGCGAGACAGCCCCG 
GCCTCGGTTTTTCCCCAGAATGGAGCCGCCCGTAGGGCAGTGGCAGCCCAGCCAGGACGGAAGCGA 
AAATCC^TTGTTTGGGCACTGATGAGG^ 
CGCA^GACTGGCAGCCTGGTGTCTGATCGAAGCCACGAC^ 

GAGTGCATTGAGCTGGGCCGGCACCGCeTCAAGCCGTGGTACTTCTCCCCGTACCCACAGGAACTC 
ACC^CATTGCCTGTCCTCTACCTGTGCGAGTTCTGCCTC^GTACGGCCGTAGTCTCIAAGTGTCTT 
C&GCGTCATTTGACCAAGTGTGACCTACGACATCCTC 

ATCT.C CTTCTTTGAGATTGATGGACGTAAGAACAAGAGTTATTCCCAGAAC CTGTGTCTTTTGGCC 

AAGTGTTTCCTTGACCATAAGA<^CTGT 

GAGTATGACTGTAAGGGCTTCCACATCX3TO 

TACAATGTGGCCTGCATCCTAACCCTGCCTCCCTACCAGCGCCGGGGCTACGGCAAGCTGGTGATC 
GAGTTCmG£TATGAACTCTCCAAAGTG(^GGG^^ 

CT!TGGCCTCCTATCCTATCGAAGCTACTGGTCCC^GACCATCCTGGAGATCCTGATGGGGCTGAAG 
T CGGAGAG C GGGGAGAGGC cacagat CAC CATCAATGAGATTAGTGAAAT CAC CAGCAT CAAGAAG 

c^ggatgtcatctccactctgcagtacctcaatctc^ 
acactgtc^gaggacatcgtggatggcc^ 

tccaagtgtctgcactt(^ctcccaaggacxggagcaagagggggaagtgg 
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>PTItf 

TTGAGTCA^GGCM<^TCAGGTTCCC 
GCAGAGG^AAATCCAAAGTGGAGAGAGGGGAAGA^ 

GCTGTGGATACTGCTG^ 

C^AG&ATGGCAGTGK3AGTGTGTC 

ACTCaOACTGGAGCTGAOT^^ 

AAGAAGCAATTTGGCGCGGAQTGCAAATACGAGTTCC^^ 

GCCCTGAAGACCAGAACTGGAAGTCTGAAGCGAGCCCTGCACA^^ 

ACCATCTCCAAGCCCTGTGGCAAACTGAC^ 

J^SGAAGG(^GAMC^GGAGAAGATGCTGGAT 

>PTPK- 

agtaactacat<^tgctgctcttatgga^^ 

taccctctgccaaacactgtaa^gacttctgg^gattagtgtatgattatggctgtacctgcl^ot 
gtgatgttaaacgakgtcgacttgtcccagggctgccct 

CGATATGGCCCCATCCAAGTGGAATGT^^ 

TTTAGGATATGCAATCTAACAAGACCACAC^ 

GGATGGGCTTCTGATCGAGj^GTGCCTGGATC 

gaaaagrggcaggaggaatgcgaggaaggggaaggc cggacgattatc cactgcctaaatggtggc 

gggcgaagtggcatgttctgtgctataggc^tc 

gatgttttcgatgcagtaaagagact^ 

taccgttt ctgctatgatgtagctttggagtac ctggaatca.tct 

>SETBD1 

• AAGGCCTCX^CCTGAGGACTAGGCATCAAGGAT^ 
ACTGACGACCGAAACAAGAl'GTCAGTAGTTACTGAAAG 
CCTGTGAAGCCTGAAGGACTTCGCCGCC CAC CT^ 
ATGGCTTCTGCTGAGTCCAACCCTGATGA^ 
GAAAGTGGGACClAGCCGAAAGCCC^CTGCT^ 

ATCGAGACCATATCCTGTGGGTCTGAAGGGGATGACTTTGAGGAGAAGAAGAACAXGACTGGTCCA 

ATGJ^GCGTCAAGTGGCAGTAAAATCAACCCGA^^^ 

ATTAAATC^CCAACATGGCCTCTGTGGACAAGGGG^^ 

CAATTCTATGATGGCGAGGAGTCTTGCTACATCATTGATGCCAAGCTTGAAGGCAACCTGGGCCGC 
■ TACCTCAACCACAGTTGCAGCCCCAACCTC 
CGCTrCCCCTGGGTGGCCTTCTTTGCC^GClAAAAGAATCCGGGCTGGGAGAGAACOT 
TACAACTACGAGGTGGGCAGTGTGGAAGGCAAGGA 
AOAGGACGTCTTCTT 
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>SS3GL3 • 

GTTGCAGAAATTCTOT^ 




TTAGACTAT(&tiA<^C^^ 

CTCAATGGAGTtfTCG&CC&dCT^^ 

TGCTGTCGTGGTCTCTATGACCT 

ATCATTACkTTAAGG^^ 

TTCTT^CCATT^ 

>SUMO-2 

CGGCCCGOXXiACMTTO^ • 

GCTTCeCGCGdGCCGCCTCCCCG^ 

GTOAAGACAG&GAATGk^^ 

aagatcaagaggg&cacgccgctgag^ 
atc^ggcaqatcagm 

ga(^t(^ggaggaggacaccatcgacgtgttccag^ 
ct(^c^gggcagagtttc . * 

>SUMO-3 

CCCTCGTCClACCGCTGCCGCCTCCTTCTT^ 
CGGACCrGGTA^CTCTTTTGT^ 

AAGaCCAAGGWU^GT^^ * 
GGTTCTGTC^TOCAGTTTAAGATTAA 

(^CACACGTQC&GAGTTGGAA^ 

GGTGTCTAG 

>TAL1~ 

AGCTt^CCCGTOA^ 

GTGC^CGAC&C^ 

TCCClmTCCTGGCCGCAG^ 

GGCCGGAAC^TGGGCGGGTG^ 

TTTCteAGC^^TACTAMGAA^ 

TITC^TAAAGATGCGATGGtT^^ 

AGGAAGGACCGJ^TTCTTATAAAGGTGTCAT CAACGTGGGAAGGAATTCAGGCTGGAAAGGAGC 

GAGQAGC^GCACGGCATCCACTGCAACAfGACGTTACT^ 

GCCGA<^CGGGTOTGACG^^ 

AaCGACAAGA^TCCTATG^^ 
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>TCPG . . 

CAGACTGACATTGAGATTACACGAGA 

ATCQQCS^GCTCTQTGAJQQACIAT^ 

TCA^TTTAGCTCAGC^^ 

GkC^TAATCGCATTGCT 

ATCACTGAC^ 

TCGGAAGTAGAACGCAACCTCCAGCIATGCCATGC^^ 
CTGGTGCCAGGGGGTGGGGCCTCCGAGATGGCTC 
ATGACTGGTGTGGAACAATGGCC^TAGAGGGCT^ 
CTGATCCAGAACTGTGGGGCCAGC^CCATCCGTC 
. GAGAACTGTGAGACOTGGGGTGTAAATGG7GAGACG^ 
ATATGGGAGCCATTGGCTGTGAAGC^^ 
CTGCGAATTGATGApVTCGTO 
GCTCCTGATGCTGGCCAGGAG 
>VIM * 

TCCCCGCGCCAGAGACGOAGCCGCGCTCCCA 

TCCGGGAGCCAGTCCGCGCCACCGCCGCCGCCCAGG^ 

AGGTCCGTGTCCTCGTCCTCCTACCGCAGGATC 

TCCAGCCGGAGCTACGTGACTACGTCCACCCG<^ 

ACCAGCCGC^GCCTCTACGCCTCGl'CCCCGGGCGGCGTGT^ 

CTGCGGAdCAGCGTGCCCGGGGTGCGGCTCCTGGAGGACTCGGTGGACTTCTCGCTGGCCGACGCC 
ATC&ACACCGAGTOCAAGAACA^ 

ttcgcca^ctac^tcgacaaggtgcgcttcctgga^ 

gagcagctcaagggccaaggcaagtcgcgcctgggggacctctac^ 

•cgccggcaggtggaccagctaagc^vacgacaaac^ 

gaggac^tcatgcgcctccjgggagaaattc^^ 

accctccaatctttc^gacaggatgot^ 

gtggaatctttgcaac^gagattgcct 

CAGGC TC&GATTCAGGAAC^GCATGT^ CTCACGGCT 
GCCCTGCGTGACGTACGTCAGCAATATGAAAGTGTC^ 
. TGGTACAAATCCAAGTTTGCTGACCTCT 
GCAAAGCAGGAGTC CACTGAGTACCGGAGACAGGTGCAGTCC CTCACCTGTGAAGTGGAX'GCCCTT 
AAAGGAACCAATGAGTCCCTGGiAACGCCAGATGCGT^ 
GCTAACTACCAAGACACTATTGGCCGCCTG^ 

CGTGACCTTCGTGAATACCAAGAC CTGCTCAATGTTAAGATGGCCCTTGACAT'T GAGATTGCCACC 

TACAGGAAGCTOCTGGAAGGCGAGGAGAGCAGGATTTCTCTGCCTCT^ 

AACCTGAGGGAAACTAATCTGGATTCACT^ 

ATTAAGACGGTTGAAACTAGAG^TGGACAGGTTATCAACGAAACTTC 
GAA 
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>VIMo 

aatgcgtctctck3cacgtcttgaccttgaacg^^ 

ttgaagaaactcc^cgaagaggaaatc caggagc tgcaggct CAGATTCAGGAACAGCATG-T c caa 

ATCGATGTGGATGTTTCCAAGCCTGACCT^^ 
AGTQIQQCTGC^ 

CAGGTGCAGTCCCTCACCTGTGAAGTGGATGCCCT 
ATGCGTGAAATGGAAGAGAACTTTGCCGTT 

CAGGATGAGATTC^GAATATGAAGGAGGAAATGGCTCGTCACGTTCGTGAATACCAAGACCTGCTC 

AATGTTAAGATGGCCCTTGAC^TT^^ 

AGG^TTFCTCTGCCTCTTCCAAACT^^ 

CCTCTGGTTGATACCCACTCiyy^ 

GTTATCAACGAAACTTCTCAGCATCACGATGACCTTGAA 
>ZHX1 

GAAC^VAAC^TAAATGATCTGACTTTTG^ 

GAATCTACAGAAGTTTCTT CTT CGGGAATATCTATCAGTAAAACTCCTATCATGLAAAATGATGAAA 
AATAAAjGTGGAAAATAAAC 

GAGAATGA^TCAAACClAGACCGTGAAGAAATTG^ 
AATACAAGTACTTCCATTGTAAACAG;^^ 

GGAGTTCTTCCTGGATTGGCACAGGTGATAAGTGCTGTATCTGCTC^ 

CCGAAAGTCTTAATCCCTGTTAATAGCATTCC^ 

TTACTTAACACCTACAACAAGTTCCCTT^ 

GC71AAATATACAGAGGAACAGAT CAAGATATGGTTTT CAGCCCAACGTTTAAAACATGGTGTTAGT 

TGGACTCCCGAGC^GTA^GGAGGCAAGAAGGAAACAATX 

CAGACCATAACTGTTATTCCTAC 

ACATGCCAAATAGTTGGTCAGC CTGGTCTGGTCCTTACTGAAGTGGCTGGAACAAAGACCTTGC C A 
GTTACAGCACCTATAGC^ 

GTACCTGCTGCTCAGCCTACTGCAGAAACAAAGCCA^ 

GTCAAACATGA^CTGCATTGGTAAACCCTGATTCATTTGGCATTCGGGCAA^ 
CAACTGGCAGAATTAAAAGTTAGCTACCTTAAAAATCAGTTTCCCCAT 
CTTATGAAZUITAACAGGCCTGACGA^^ 
CAGAGAAATTCAAAGAGTAATCAGTGCTTAC^TC 

GACTCCAGTGATGAAAC CACGGAAXCCCCAACTGTTGGTACTGCACAGCCrAAGCAATGC T?GGAAT 
- CCTTTTCCTGACTTTACTC CCGAAAAGTTTAAAGAGAAAACT CTTCAG 
GCAAGTOTTCTCAACAGCTCTGTACTTACAGATGAAGAATTAAATAGGTTAAGGGC^ 
CTTACCAGAAGAGAAATCGATGCTTGGTTTACAGAGAAGAAGAAATGAAAAGCTT 
AAAATGGAAATAGATGAAAGTAATGCAGGTAGTTCGAAAGAAGAAGCTG(^GAAAGTTCTCCTGCA 
GATGAATCTGGTGCACCTAAGTCAGGGAGTACAGGO^GATATGTAAAAAAACACCTGAGCAGCTG 
C^CArGCTTAAGAGTGCATTTGTCCGGACACAGTGGGCATCACCAGAAGAGTATGACAAGTTGGCC 
AAkGAAAGCGGGCTTGCTAGAACAGACATAGTT^ 
• AATGGAAACTTGA^TGGTAOTACTACTATC^ 

TCCOTTAGGAAAAGAGGGAGAGC^AGACCCAAAGGACGGGGAAGAGGAAGACCGCGTGGGC'GGCCT 
AGAGGAAGCAAAAGAATTAAC^CTGGGACAGGGGACCM 

GCAATACTTAAGGATTATTACCTGAAGCACAAGTXTCTTAATGAGCAAGACCTTGATGAACTTGTT 
AACAAATCACATATGGGCTATGAGC^GGTCAGAGAGTGGTTTGC 

TTAGGTATAGAATTATTTGAGGAAAATGAGGAGGAAGATGAAGTTATTGATGACCAGGAAGAGGAT 
GAAGAAGAAACAGATGATAGTCJACACOT^ 
' TCAGATGAC 
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>2NF33B 

TGTTATGAA.TGTGGGAAAACC 

AAATCA!TACCTAACAAAACATAATA<^ 

1ATGAATG?AATCAGTC^ 

CATACACAAGAAAAGCCTTATAAATGT^ 

ATTTTACATGAG&GAAAGCAC^CGGGOT^ 

AGtCAO^TCATCA^ 

G^TGTGGAAAAATCTTTTACCGTJ^ 

^GCCCTATGAATGTAACACATG<^^ 

AGAACAGACATAGGAGAAAAACGTTATGAA 

Figure 6 (continued) 
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Nucleotide sequence data (fasta format) 

>GDF9 

CCTGT GGGAGAAGAGGCT GCT GAGGAT GGGAGAT CTT CCCAT CACCGT CACCGCAGAGGT CAGGAAACTGTCAGT 
TCTGAATTGAAGAAGCCCTTGGGCCCAGCTTCCTTCAATCTGAGTGAATACTTCAGACAATTTCTTCTTCCCCAA 
AATGAGTGTGAGCTCCATGACTTTAGACTTAGCTTTAGTCAGCTGAAGTGGGACAACTGGATTGTGGCTCCGCAC 
AGGTACAACCCTCGATAC T GTAAAGGGGACTGTCCAAGGGCAGT TGGACATC GGTAT GGCTCT CCAGT TCACACC 
ATGGTACAGAACATCAT CTATGAGAAGCT GGACTCCT CAGTGCCAAGACCGT CATGT GTACC T GCCAAATACAGC 
C CC TT GAGT GT T T TGACCATT GAGCCCGATGGCT CAAT TGCCT ATAAAGAGT ACGAAGATAT GATAGC T ACAAAG 
TGCACCTGTCGT ' 

>GAPD ' 

CCTGTTCGACAGTCAGCCGCATCTTCTTTTGCGTCGCCAGCCGAGCCACATCGCTCAGACACCATGGGGAAGGTG 
AAGGTCGGAGTCAACGGATTTGGTCGTATTGGGCGCCTGGTCACCAGGGCTGCTTTTAACTCTGGTAAAGTGGAT 
AT T GT TGCCAT CAAT GACCCCT TCAT T GACCTCAAC T ACATGGT T TACATGT TCCAATAT GATTCCAC CCATGGC 
AAATT CCAT GGCACCGTCAAGGCT GAGAACGGGAAGCT TGTCAT CAAT GGAAATCCCATCAC C ATCT T CCAGGAG 
CGAGATCCCTCCAAAATCAAGTGGGGCGATGCTGGCGCTGAGTACGTCGTGGAGTCCACTGGCGTCTTCACCACC 
ATGGAGAAGGCTGGGGCTCATTTGCAGGGGGGAGCCAAAAGGGTCATCATCTCTGCCCCCTCTGCTGATGCCCCC 
ATGTTCGTCATGGGTGTGAACCATGAGAAGTATGACAACAGCCTCAAGATCATCAGCAATGCCTCCTGCACCACC 
AACTGCTTAGCACCCCTGGCCAAGGTCATCCATGACAACTTTGGTATCGTGGAAGGACTCATGAGCACAGTCCAT. 
GCCATCACTGCCACCCAGAAGACTGTGGATGGCCCCTCCGGGAAACTGTGGCGTGATGGCCGCGGGGCTCTCCAG 
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